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Abstract 



> 

L-. ' We extend our previous results on local asymptotic normality (LAN) for qubits [181 I15j to 

f^ ' quantum systems of arbitrary finite dimension d. LAN means that the quantum statistical 

— ^ , model consisting of n identically prepared d-dimensional systems with joint state p®" converges 

f^ ' as n ^ cxD to a statistical model consisting of classical and quantum Gaussian variables with 

Qy ■ fixed and known covariance matrix, and unknown means related to the parameters of the density 

matrix p. Remarkably, the limit model splits into a product of a classical Gaussian with mean 

equal to the diagonal parameters, and independent harmonic oscillators prepared in thermal 
^^ . equilibrium states displaced by an amount proportional to the off-diagonal elements. 

JH , As in the qubits case [15] , LAN is the main ingredient in devising a general two step adaptive 

procedure for the optimal estimation of completely unknown d-dimensional quantum states. 

This measurement strategy shall be described in a forthcoming paper |17] . 
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1 Introduction 

Quantum statistics deals with problems of statistical inference arising in quantum mechanics. The 
first significant results in this area appeared in the seventies and tackled issues such as quantum 
Cramer-Rao bounds for unbiased estimators, optimal estimation for families of states possessing 
a group symmetry, estimation of Gaussian states, optimal discrimination between non-commuting 
states. It is impossible to list all contributions but the following references may give the flavour of 
these developments [13 Hi SZl [3 H [Mj ■ The more recent theoretical advances [^ [^ [55 1 [51 [^ [5] 
are closely related to the rapid development of quantum information and quantum engineering, and 
are often accompanied by practical implementations [H [HI [40l [39] . 

An important topic in quantum statistics is that of optimal estimation of an unknown state using 
the results of measurements performed on n identically prepared quantum systems [32l [9l [45l [131 
[30l m [25l [211 HI [12] . In the case of two dimensional systems, or qubits, the problem has been solved 
explicitly in the Bayesian set-up, in the particular case of an invariant prior and figure of merit 
based on the fidelity distance between states [5]. However the method used there does not work for 
more general priors, loss functions, or higher dimensions. In the pointwise approach, Hayashi and 
Matsumoto [25] showed that the Holevo bound [28] for the variance of locally unbiased estimators 
can be achieved asymptotically, and provided a sequence of measurements with this property. Their 
results, building on earlier work [3T1 HD], indicate for the first time the emergence of a Gaussian 
limit in the problem of optimal state estimation for qubits. The extension to d-dimensional case is 
analysed by Matsumoto in [33] . 

In [181I15J we performed a detailed analysis of this phenomenon (again for qubits), and showed that 
we deal with the quantum generalization of an important concept in mathematical statistics called 
local asymptotic normality. As a corollary, we devised a two steps adaptive measurement strategy 
for state estimation which is asymptotically optimal for a large class of loss functions and priors, and 
could be practically implemented using continuous-time measurements. In 'classical statistics', the 
idea of approximating a sequence of statistical models by a family of Gaussian distributions appeared 
in [46] , and was fully developed by Le Cam [31] who coined the term "local asymptotic normality" . 
Among the many applications we mention its role in asymptotic optimality theory and in proving 
the asymptotic normality of certain estimators such as the maximum likelihood estimator. The aim 
of this paper is to extend the results of [181 115] to systems of arbitrary dimension d < oo, and thus 
provide the main tool for solving the open problem of optimal state estimation for d-dimensional 
quantum systems [17) . 

Before stating the main result of the paper we will explain briefly the meaning of local asymptotic 
normality for two dimensional systems [181 115] . We are given n qubits identically prepared in 
an unknown state p. Asymptotic normality means that for large n we can encode the statistical 
information contained in the state p**" into a Gaussian model consisting of a classical random 
variable with distribution N(u, I^^), and a quantum harmonic oscillator prepared in a (Gaussian) 
displaced thermal state $1^. The term local refers to how p is related to the parameters 9 — {u, Q, 
as explained below. 



For a more precise formulation let us parametrise the qubit states by their Bloch vectors p{r) — 
i(l + l^~a) where ~a — (era;, fy, ^z) are the Pauli matrices. The neighborhood of the state po with 
To — (0, 0, 2/i — 1) and 1/2 < /i < 1, is a three-dimensional ball parametrised by the deviation u e K 
of diagonal elements and ^ G C of the off-diagonal ones 

Pe^i ^'^/ ^ ^* ), e = {u,OeRxC. (f.l) 

V C I- n-u ) 

Note that po is to be considered fixed and known but otherwise arbitrary, and can be taken to be 
diagonal without any loss of generality. Consider now n identically prepared qubits whose individual 
states are in a neighborhood of po of size l/^/n, so that their joint state is pg := [pe/^] for some 
unknown 0. We would like to understand the structure of the family (statistical experiment) 

Qn:^{pl>:\\e\\<C}, (1.2) 

as a whole, more precisely what is its asymptotic behavior as n -^ oo ? 

For this we consider a quantum harmonic oscillator with position and momentum operators satisfying 
the commutation relations [Q,P] = il- We denote by {\n),n > 0} the eigenbasis of the number 
operator and define the thermal equilibrium state 



oo 



$ = (l-e-'3)^e-'^''3|fc)(fc|, e-^ = 



l-fi 



k=o ^ 

which has centered Gaussian distributions for both Q and P with variance l/(4/i — 2) > 1/2. We 
define a family of displaced thermal equilibrium states 

$C.^2?';($):=W^(C/y2;r^T)$M/(C/ 72^1^1)*, (1.3) 

where W{() :— exp(Ca* — (a) is the unitary displacement operator with C g C. Additionally we 
consider a classical Gaussian shift model consisting of the family of normal distributions N{u, /i(l — 
fi)) with unknown center u and fixed known variance. The classical-quantum statistical experiment 
to which we alluded above is 

n := {$^ :=: N{u, /i(l - fi)) ® $'^ : ||6l|| < C} (1.4) 

where the unknown parameters = (u, C) G R x C are the same as those of Q„. 

Theorem 1.1. U8\.\15f Let Q„ be the quantum statistical experiment (jl.2p and letTZ be the classical- 
quantum experiment (jl.4p . Then for each n there exist quantum channels (normalized completely 
positive maps) 

Sn ■■ Li(R)(»r(L2(M))^M((C^)®"), 
with T{L^(M.)) the trace- class operators, such that 

lim sup ||$e-T„(p^)||i=0, 

"^°° ||0||<C 

lim sup 11/9^ -S'„($e) 111 =0, 
"^°° lieiKc 



for an arbitrary constant C > 0. 

The theorem shows that from a statistical point of view the joint qubits states are asymptotically 
indistinguishable from the limit Gaussian system. At the first sight one might object that the local 
nature of the result prevents us from drawing any conclusions for the original model of a completely 
unknown state p. However this is not a limitation, but reflects the correct normalisation of the 
parameters with n — > oo. Indeed as n grows we have more information about the state which can 
be pinned down to a region of size slightly larger that l/^/n by performing rough measurements 
on a small proportion of the systems. After this 'localisation' step, we can use more sophisticated 
techniques to better estimate the state within the local neighborhood of the first step estimator, and 
it is here where we use the local asymptotic normality result. Indeed, since locally the states are 
uniformly close to displaced Gaussian states we can pull back the optimal (heterodyne) measurement 
for estimating the latter to get an asymptotically optimal measurement for the former. Based on 
this insight we have proposed a realistic measurement set-up for this purpose using an atom-field 
interaction and continuous measurements in the field [15]. 

This paper deals with the extension of the previous result to d-dimensional systems. Like in the 
two-dimensional case we parametrise the neighbourhood of a fixed (diagonal) state po by a vector 
u E K'^^^ of diagonal parameters and d{d— l)/2 complex parameters ( = {Q^k ■ j < k), one for each 
off-diagonal matrix element (cf. ()4.2|) and ()4.4I) ). We consider the same l/y^— scaling and look at 
the family 

Qn = { h/^] ^" : e = (^?, c) e e„ c R^-i ® c^i^'^y^} , 

where 0„ is a ball of local parameters whose size is allowed to grow slowly with n. 

As in the 2-dimcnsional case, the limit model is the product of a classical statistical model depending 
on the parameters u and a quantum model depending on C. Moreover the quantum part splits into 
a tensor product of displaced thermal states of quantum oscillators, one for each off-diagonal matrix 
element Q^k with j < k. Thus 

$^ = N{u, /;/) ® (g) $5;,'= , 9 = iu, c). 
]<k 

Here, Ip^ is the Fisher information matrix of the multinomial model with parameters (/ii, . . . , fid) 
described in Example 1 3. 5 [ and ^-'j^ is the displaced thermal equilibrium state defined in (|4.35p with 
inverse temperature /3 = ln(/ij//ifc). 



Theorem l4.3l is the main result of the paper and shows the convergence of Q„ to the Gaussian model 

7^„ = l^*^ : 6* e e„ C M'^^i (g) C'^(d~^)/A , 

in the spirit of Theorem 11.11 On the technical side, the uniform convergence holds over local 
neighbourhoods 0„ which are allowed to grow with n rather that being fixed balls. This is essential 
for constructing the two stage optimal measurement: first localise within a neighbourhood 0„, and 
then apply the optimal Gaussian measurement. The details of this construction are similar to the 
two dimensional case and will be given in a subsequent paper [[17,. 



Despite the similarity to the two dimensional case, the proof of the d-dimensional result has ad- 
ditional features which may be responsible for the fact that the optimal estimation problem has 
remained unsolved until now. The proof is based on the following observations: 

• the n systems space (C^)®" decomposes into a direct sum of irreducible representations of 
SU{d), each representation being labelled by a Young diagram A (cf. Theorem 14. ip : 

• the joint state pf?^ has the block diagonal form (|4.8p . the block weights A — > p^'" depend 
only on the diagonal parameters u and are closely related to the multinomial distribution of 
Example 13.51 This classical statistical model converges to the {d — l)-dimensional Gaussian 
shift model N{u, I p^^); 

• there exists an isometry Vx mapping basis vectors jm, A) of the irreducible representation Tix 
almost into number vectors |m) of the multimode Fock space, where m = {mj,k : j < k} is 
the collection of number eigenvalues for all oscillators. 

• given a typical A, the conditional block-state p^^ can be mapped with V\ into a multimode 
state which is close (in trace norm) to the Gaussian product state (>!)j<k^/j^ ■ This can be done 
uniformly over the typical diagrams whose normalised shapes have '^j \pn fluctuations around 
(/ii, /i2, . . . , /id), and over parameters Q € 0„. 

The first item is the well known Weyl duality which is extensively used in quantum statistics for 
i.i.d. states. The probability distribution of the second point has also been analysed the context of 
large deviations [30] for the estimation of the state eigenvalues. The third point shows that the basis 
jm, A) is almost orthogonal for indices m which are not too big. This basis is obtained by projecting 
tensors of the form /a := fa{\) ®---® fa{n) onto a subspace of (C'')'^" which is isomorphic to Hx (cf. 
Theorem l5.2p . Let us place the indices {a(i) : i = 1 . . .n} in the boxes of the diagram A along rows, 
starting from the left end of the first row, to obtain a tableau t^. It turns out that we only need 
to consider /a for which ia is a semistandard tableau (nondecreasing along rows, increasing along 
columns). Then the label m := {rriij : j > i} is the collection of integers mij equal to the number 
oi j's on the row i, and is in one to one correspondence with a. The following is an example of such 
semistandard tableau 

l | l | l | l | l | l | l|l|2|2|2|3|"3l 
im = 2 2 2 "2I 2|3 I , with toi,2 = 3, mi,3 = 2, m2,3 = 1- 

3I3I3I 

The relatively large number of i's in the row i is intentional, since it turns out that the 'relevant' 
vectors, i.e. those carrying the states p^^, have indices rUij small compared with the length of the 
rows (Ai w niii for typical representations A). More precisely, in section [7^ we prove the following 
quasi-orthogonality result which allows us to carry the block states over to the oscillator space: if 
m 7^ 1 and |1| < |m| < 71'' then 

I (m, A|l, A) I = o(„(9')-2)|m-i|/i2) ^ Q fQ^ ^ ^ 2/9. 

n — ►CO 

The proof of the fourth point involves a detailed analysis of the state p^^ through its coefficients 
in the basis |m. A) of Hx- When 6 = the state is diagonal and its coefficients approach uniformly 



those of the muhidimensional thermal state <I>° — ^j<ck^j,k as shown in Lemma 15751 The next step 
is to apply SU{d) rotations and obtain the states Px'"- In Lemmas 16.41 and 16.51 it is shown that the 
unitary operations Ad[U\{(^/y/n)] act on p_^'" in the same way as the displacement operator £)'• acts 
on the thermal state $'^. A remarkable fact is that in the limit the different off-diagonal parameters 
'separate' into a product of shift experiments for quantum oscillators, one for each off-diagonal index 
{j < k). This could be guessed from the Quantum Central Limit Theorem 14.41 which is related to 
the restriction of our result to 6 ^ 0. 

Due to the apparent intricacy of the main result, the paper is organised according to the 'onion 
peeling' principle. We start in section [2] with general classical statistical notions which motivate our 
investigation in quantum statistics. In particular we explain the relevance of the Lc Cam distance 
between statistical models as a statistically meaningful way to describe convergence. Section [3] 
presents the classical version of local asymptotic normality with the multinomial model as example. 

In section |4] we introduce the quantum statistical model consisting of n identical quantum systems 
with joint state p^'" described by diagonal and rotation parameters. We also introduce the multimode 
Gaussian states appearing in the limit. With this we can formulate the main result. Theorem 14.31 

In section[5]we introduce the basis |m. A) and the isometry Vx allowing us to define the channels r„ 
and Sn connecting the two statistical models. 

In section [B] we break the proof of the main theorem into manageable lemmas, essentially by using 
triangle inequalities. Each lemma deals with a different aspect of the convergence and has an interest 
in its own. 



Finally, the technical proofs are collected in section [T] Notably, subsection 17.11 and Lemma 17.11 
contain the combinatorial substance of the paper. 

Our investigation relies on the theory of representations of SU{d). We refer to [TUIIIIIIII] for proofs 
of standard results and more details. 

As in the two-dimensional case [K] , local asymptotic normality provides a two stage adaptive mea- 
surement strategy which is asymptotically optimal for both Bayesian and pointwise viewpoints, and 
for a large range of 'distances' on the state space [17j . 



2 Classical and quantum statistical experiments 

In this section we introduce some basic notions from classical statistics with the aim of defining 
the Le Cam distance between statistical models and local asymptotic normality. In parallel, we 
will define the quantum analogues and point out their relevance in quantum statistics. The reader 
may find the conceptual framework helpful in understanding the quantum version of the result, but 
otherwise the section can be skipped at the first reading. 

Let X be a random variable with values in the measure space (A", E;^'), and let us assume that its 
probability distribution P belongs to some family {Pg : 9 G 0} where the parameter 9 is unknown. 
Statistical inference deals with the question of how to use the available data X in order to draw 



conclusions about some property of 9. We shall call the family 

£:={Pe:e&Q}, (2.1) 

a statistical experiment or statistical model over {X , Yjx) |31| . 

In quantum statistics the data is replaced by a quantum system prepared in a state (p which belongs 
to a family {(f)g : 6* e 6} of states over an algebra of observables. In order to make a statistical 
inference about 9 one first has to measure the system, and then apply statistical techniques to draw 
conclusions from the data consisting of the measurement outcomes. An important difference with 
the classical case is that the experimenter has the possibility to choose the measurement set-up M , 
and each set-up will lead to a different classical model {Pg : 9 £ Q}, where Pg is the distribution 
of outcomes when performing the measurement M on the system prepared in state (j)g . 

The guiding idea of this paper is to investigate the structure of the family of quantum states 

Q -.^ {cj)g : 9 e e}, 

which will be called a quantum statistical experiment. We shall show that in an important asymp- 
totic set-up, namely that of a large number of identically prepared systems, the joint state can be 
approximated by a multidimensional quantum Gaussian state, for all possible preparations of the 
individual systems. This will bring a drastic simplification in the problem of optimal estimation for 
d-dimensional quantum systems, which can then be solved in the asymptotic framework [TTj. 

2.1 Classical and quantum randomizations 

Any statistical decision (e.g. estimator, test) can be seen as data processing using a Markov kernel. 
Suppose we are given a random variable X taking values in {X,Y,x) and we want to produce a 
'decision' y ^ y based on the data X. The space y may be for example the parameter space Q 
in the case of estimation, or just the set {0, 1} in the case of testing between two hypotheses. For 
every value x £ X we choose y randomly with probability distribution given by Kx{dy). Assuming 
that K : X X Yiy -^ [0, 1] is measurable with respect to x for all fixed A E T,y, we can regard K as 
a map from probability distributions over {X,'E,x) to probability distributions over (J^, S3;) with 

K{P){A) = J K^{A)P{dx), AeJ^y. (2.2) 

A statistic S : X —> y is a particular example of such a procedure, where K^ is simply the delta 
measure at S{x). 

Besides statistical decisions, there is another important reason why one would like to apply such 
treatment to the data, namely to summarize it in a more convenient and informative way for fu- 
ture purposes as illustrated in the following simple example. Consider n independent identically 
distributed random variables Xi, . . . , Xn with values in {0, 1} and distribution Pg := (1 — 9, 9) with 
9 £& :— (0, 1). The associated statistical experiment is 

£n :- {Pe-0£ e}. 



It is easy to see that Xn = — X]r=i ^* ^^ ^'^ unbiased estimator of 6 and moreover it is a sufficient 
statistic for £„, i.e. the conditional distribution P^(-\Xn = x) does not depend on 0\ In other words 
the dependence on 6 of the total sample (Xi, X2, . . . , Xn) is completely captured by the statistic Xn 
which can be used as such for any statistical decision problem concerning £„ . If we denote by Pg 
the distribution of Xn then the experiment 

£n = W -OeQ}, 

is statistically equivalent to £„. To convince ourselves that X„ does contain the same statistical 
information as (^i, . . . , X^), we show that we can obtain the latter from the former by means of a 
randomized statistic. Indeed for every fixed value x of Xn there exists a measurable function 

/, : [0, 1] ^ {0, ir , 

such that the distribution of fxiU) is Pg{-\Xn = x). In other words 

MJs^{xi^---^Xn)) = Pg{xi,...,Xn\Xn =S), 

where A is the Lebesgue measure on [0,1]. Then F{Xn,U) :~ fx„{U), has distribution P^. To 
summarize, statistics, randomized statistics and Markov kernels, are ways to transform the available 
data for a specific purpose. The Markov kernel K defined in (|2.2p maps the experiment £ of equation 
(|2.ip into the experiment 

over (3^, S3;) with Qg = K{Pg). For mathematical convenience it is useful to represent such trans- 
formations in terms of linear maps between linear spaces. 

Definition 2.1. A positive linear map 

is called a stochastic operator or transition if ||T'*(<?)||i = \\g\\i for every g G L^{X). 
Definition 2.2. A positive linear map 

is called a Markov operator if Tl — 1, and if for any /« i m L°°{y) we have Tfn i 0. 

A pair (T* , T) as above is called a dual pair if 

J fT{g)dP = jT4f)gdQ, 

for all / G L^{X, T,x, P) and g G L°°{y, Y^y, Q). It is a theorem that for any stochastic operator T* 
there exists a unique dual Markov operator T and vice versa. 

What is the relation between Markov operators and Markov kernels ? Roughly speaking, any Markov 
kernel defines a Markov operator when we restrict to families of dominated probability measures. Let 

10 



us assume that all distributions Pg of the experiment £ defined in (|2.ip are absolutely continuous with 
respect to a fixed probability distribution P, such that there exist densities pe := dPg/dP : X -^ M+. 
Such an experiment is called dominated and in concrete situations this condition is usually satisfied. 
Let Kx{dy) be a Markov kernel (|2.2p such that Qg — K{Pg), then we define associated Markov 
operator {T{f)){x) :— j f{y)kx{dy) and have 

Qg^Pgo T, ye. (2.3) 



When the probability distributions of two experiments are related to each other as in (|2.3p . we 
say that J^ is a randomization of £. From the duality between T and T^ we obtain an equivalent 
characterization in terms of the stochastic operator T* : L^{X, Y.x, P) ~^ L^{y^ E^;, Q) such that 

T4dPg/dP)^dQg/dQ, we. 

The concept of randomization is weaker than that of Markov kernel transformation, but under the 
additional condition that {y, 'Sy) is locally compact space with countable base and Borel cr-field, it 
can be shown that any randomization can be implemented by a Markov kernel i 4lj . 

What is the analogue of randomizations in the quantum case ? In the language of operator algebras 
L°°{X, Ea", P) is a commutative von Neumann algebra and L^{X, 'Sx, P) is the space of (densities 
of) normal linear functionals on it. The stochastic operator T^ is the classical version of quantum 
channel, i.e. a completely positive normalized (trace-preserving) map 

where A*,B:^ are the spaces of normal states on the von Neumann algebra A and respectively B. 
Any normal state on ^ has a density p with respect to the trace such that 4'{A) = Tx{pA) for all 
A(^A. The dual of T, is 

T:B^A, 

which is a unital completely positive map and has the property that T^,{(j))(b) ~ (j){T{b)) for all 6 e ,S 
and (j) €i A*. We interpret such quantum channels as possible physical transformations from input 
to output states. 

A particular class of channels is that of measurements. In this case the input is the state of a 
quantum system described by an algebra A, and the output is a probability distribution over the 
space of outcomes {X, T,x)- Any measurement is described by a positive linear map 

M ■.l°°{x,j:x,p)~>a, 

which is completely specified by the image of characteristic functions of measurable sets, also called 
positive operator valued measure (POVM). This map M : Yix -^ A has following properties 

1. Positive: M{A) > 0, VA e E^- ; 

2. Countably additive: Y.7Li M{Ai) = Af (U,A,), A^ r\Aj^%,i^ j; 

3. Normalized: M{X) = 1. 

11 



The corresponding channel acting on states is a positive map M.^, : A■^, — > L^{X, Y^x, P) given by 

M{<f){A) = cj,{M{A)) = Tr(M/(^)), 

where p is the density matrix of (p. By applying the channel M to the quantum statistical experiment 
consisting of the family of states Q = {(j)g : 9 E Q) on A we obtain a classical statistical experiment 

Qm :- {M{M -Oee}, 

over the outcomes space {X,I]x)- 

As in the classical case, quantum channels can be seen as ways to compare quantum experiments. 
The first steps in this direction were made by Petz [351 ISHl IM] who developed the theory of quantum 
sufficiency dealing with the problem of characterizing when a sub-algebra of observables contains 
the same statistical information about a family of states, as the original algebra. More generally, 
two experiments Q := {A, 4>s : € Q} and TZ := {B, tpg : 9 £ Q} are called statistically equivalent if 
there exist channels T : A^ B and S : B —^ A such that 

ipg oT — (pg and 4'e o S = t/jg \f9. 

As consequence, for any measurement Af : L°°{X , Yjx, P) ^r A there exists a measurement T o M : 
L°°{X,T,x,P) -^ B such that the resulting classical experiments coincide Qm = T^ToM- Thus for 
any statistical problem, and any procedure concerning the experiment Q there exists a procedure 
for TZ with the same risk (average cost), and vice versa. 

2.2 The Le Cam distance and its statistical meaning 

We have seen that two experiments are statistically equivalent when they can be transformed into 
each other be means of quantum channels. When this cannot be done exactly, we would like to have 
a measure of how close the two experiments are when we allow any channel transformation. We 
define the deficiency of TZ with respect to Q as 

6{TZ,Q)^misupUe-ijgoT\\ (2.4) 

T g 

where the infimum is taken over all channels T : A-^ B. The norm distance between two states on 
A is defined as 

\\(j)i - (j)2\\ := sup{|0i(a) - 02(a)| : a e A, \\a\\ < 1}, 

and for A — B{Ti.) it is equal to \\pi — P2II1 '■— Tr(|pi — P2I), where pi is the density matrix of 
the state 0^. When 6{TZ, Q) — we say that TZ is more informative than Q. Note that S{TZ, Q) 
is not symmetric but satisfies a triangle inequality of the form S{TZ, Q) + d{Q,T) > 6{TZ,T). By 
symmetrizing we obtain a proper distance over the space of equivalence classes of experiments, called 
Le Cam's distance [31] 

A(Q, TZ) := max {6{Q, TZ) , 6{TZ, Q)) . (2.5) 

What is the statistical meaning of the Le Cam distance ? We shall show that if S{TZ, Q) < e then 
for any statistical decision problem with loss function between and 1, any measurement procedure 

12 



for Q can be matched by a measurement procedure for TZ whose risk will be at most e larger than 
the previous one. 

A decision problem is specified by a decision space {X , '^x) and a loss function We : A" — > [0, 1] for 
each 9 Cz 0. We are given a quantum system prepared in the state (jje £ A* with unknown parameter 
6 Cz Q and would like to perform a measurement with outcomes in X such that the expected value 
of the loss function Wg is small. Let 

M ■.l°°{x,j:x,p)-^a, 

be such a measurement, and Pg = (pe ° M , then the risk at 9 is 

R{M,9):= I We{x)P^^\dx). 
Jx 

Since the point 9 is unknown one would like to obtain a small risk over all possible realizations 

Rmax{M)^snpR{M,9). 
eee 

The minimax risk is then Rminmax '■= ^'^^^m Rmax{M). In the Bayesian framework one considers a 
prior distribution tt over 8 and then averages the risk with respect to tt 

R^{M) = [ R{M,9)Tr{d0). 
Je 

The optimal risk in this case is Rtt := infM Rtt{M). 

Coming back to the experiments Q and TZ we shall compare their achievable risks for a given decision 
problem as above. Consider the measurement N : L°"{X, I^x, P) ^ B given hy N — T o M where 
T : A^ B is the channel which achieves the infimum in (12.411. Then 



R{N,9) = / W{9,x)PJ,''\dx)^MToM{Wg)) 
Jx 

< II V'e o T - 0e|| + cbeiMiWe)) < S{n, Q) + R{M, 9), 
where we have used the fact that < Wg < 1. 

Lemma 2.3. For every achievable risk R{M, 9) for Q there exists a measurement N : L°°[X, T,x, P) - 
B for TZ such that 

R{N,9)<R{M,9)+5{TZ,Q). 

In consequence 

3 Local asymptotic normality in statistics 

In this section we describe the notion of local asymptotic normality and its significance in statistics 
[5T1 [m HU 133] • Suppose that we observe Xi, . . . , Xn where Xi take values in a measurable space 
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{X, Yix) and are are independent, identically distributed with distribution Pg indexed by a parameter 
9 belonging to an open subset 8 C W". The full sample is a single observation from the product 
Pg of n copies of Pe on the sample space (il",I]"). Local asymptotic normality means that for 
large n such statistical experiments can be approximated by Gaussian experiments after a suitable 
reparametrisation. Let 9o be a fixed point and define a local parameter u = y/n{9—9o) characterizing 
points in a small neighbourhood of ^Oi and rewrite Pg as Pg , , /— seen as a distribution depending 
on the parameter u. Local asymptotic normality means that for large n the experiments 



[Pl 



+u/^/n 



u e 



and 



{iV(u,V;):z.eM™} 



have the same statistical properties when the models 9 i-^ Pg are sufficiently 'smooth'. The point of 
this result is that while the original experiment may be difficult to analyse, the limit one is a tractable 
Gaussian shift experiment in which we observe a single sample from the normal distribution with 
unknown mean u and fixed variance matrix 17 . Here 

is the Fisher information matrix at 9q, with Ig^ :— dlogpg/d9i the score function and pg is the 
density of Pg with respect to a reference probability distribution P. 

There exist two formulations of the result depending on the notion of convergence which one uses. 
In this paper we only discuss the strong version based on convergence with respect to the Le Cam 
distance, and we refer to [13] for another formulation using the so called weak convergence (conver- 
gence in distribution of finite dimensional marginals of the likelihood ratio process), and to [16] for 
its generalization to quantum statistical experiments. 

Before formulating the theorem, we explain what sufficiently smooth means. The least restrictive 
condition is that pg is differentiable in quadratic mean, i.e. there exists a measurable function 
ig : X ^ M. such that as u ^ 



1/2 

Ps+u 



1/2 



1/2 



dP ^0. 



Note that ig must still be interpreted as score function since under some regularity conditions we 
have dpl'^/d9i = \{d\ogpg / d9i)py^ . 

Theorem 3.1. Let £ := {Pg ; e 6} be a statistical experiment with 8 C K'^ and Pg '^ P such 
that the map 9 ^ pg is differentiable in quadratic mean. Define 

£n = {Pl+^,^ ■■ \\u\\ < C}, T={N{u,Ig^') : Hull < C}, 

with Ig^^ the Fisher information matrix of £ at point 9q, and C a positive constant. Then A(£„, J-') —> 
0. In other words, there exist sequences of randomizations T„ and 5„ such that: 



lim sup 

"^°° ll"ll<c 

lim sup 

"^~ ll«ll<c 



^»(^."o+«/Vw)-^("'Vo') 



pn 



Z+u,VT.-Sn{N{uJg;)) 



= 0, 
= 0. 
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Remark 3.2. Note that the statement of the Theorem is not of Central Limit type which typicaUy 
involves convergence in distribution to a Gaussian distribution at a single point ^o- Local asymptotic 
normality states that the convergence is uniform around the point 6q, and moreover the variance of 
the limit Gaussian is fixed whereas the variance obtained from the Central Limit Theorem depends on 
the point 0. Additionally, the randomization transforming the data (Xi, . . . , Xn) into the Gaussian 
variable is the same for all 9 = Oq + u/ ^/n and thus does not require a priori the knowledge of 9. 

Remark 3.3. Local asymptotic normality is the basis of many important results in asymptotic 
optimality theory and explains the asymptotic normality of certain estimators such as the maximum 
likelihood estimator. The quantum version introduced in the next section plays a similar role for the 
case of quantum statistical model. An asymptotically optimal estimation strategy based on local 
asymptotic normality was derived in [15j for two-dimensional systems. 

Remark 3.4. Let us define the real Hilbert space L?'{9q) = (R™, (•, ■)gg) with inner product 

(u, u)eo = u^hoV- 

By multiplying with Ig^ we see that limit experiment can be equivalently chosen to be N^Ig^u, Ibq)- 
The characteristic function oi X ^^ N{IggU, Ig^) is 

Fuiw) := EeJexp(iw^X)] = exp f --||u;||2^ + i(w, u)ej . (3.1) 

A similar expression will be encountered in section [4] for the case of quantum Gaussian shift experi- 
ment. 

Example 3.5. Let P^ — (fii, . . . , fj.d) be a probability distribution with unknown parameters {fii, . . . , jJ-d-i) G 



_l_ satisfying fii > and J2i<d-i Mi < 1- T'he Fisher information at a point jjl i 



IS 



d-l d-l d-l 

Ht^h ^J2^"'^^^''^'^^ ' '^^^^7') + (^ - II /-*')"' = '^y^«^' + (^ - 11^^')"'' (3-2) 

fc=i 1=1 1=1 

and its inverse is 

VifJ.)lj ■■= [/(/i)"^]y = Sijfl., - fiiflj. (3.3) 

Thus the limit experiment in this case is T :— {N{u, V^(/i)) : u G M , ||u|| < C). 



This experiment will appear again in Theorem 14. 3[ as the classical part of the limit Gaussian shift 
experiment. 

4 Local asymptotic normality in quantum statistics 

In this section we present the main result of the paper. Local asymptotic normality for d-dimensional 
quantum systems means roughly the following: the sequence Q„ of experiments consisting of joint 
states p**" of n identical quantum systems prepared independently in the same state p, converges 
to a limit experiment TZ which is a quantum-classical Gaussian model involving displaced thermal 
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equilibrium states oi d{d— l)/2 oscillators and a, (d— l)-dimensional classical Gaussian shift model. 
As in the classical case, the result has a local nature reflecting the l/\/n rate of convergence of 
state estimation. A neighbourhood of a fixed diagonal state po = Diag(/xi, . . . , /i^) is parametrised 
by (changes in the) diagonal parameters u € M.'^^^ and off-diagonal parameters ( £ (!2d{d~i)/2^ rpj^^ 
latter can be implemented by small unitary rotations. The limit Gaussian model has a classical 
part N[u,V(ii)) with fixed known variance V{fj,), and a quantum part 



^9i!° with each $ 



yi<fc'^j> 






being a thermal equilibrium state with /3j_fc = ln(/Zj///fe), displaced in phase space by an amount 
proportional to Q^k- 

The reason for choosing the above parametrisation is twofold. Firstly, it unveils the important 
separation between 'classical' and 'quantum' parameters, and the further separation among the 
different off-diagonal parameters. Secondly, it is very convenient for the proof. However as we will 
see in ???, the limit experiment can be formulated in a 'coordinate-free' way in terms of quasifree 
states on CCi?-algebra. Although it is not needed in the main theorem, we include this formulation 
linking our result to the Quantum Central Limit Theorem. We stress again that local asymptotic 
normality is not a consequence of the Central Limit Theorem, indeed the latter is not even an 
ingredient in the proof but gives an indication as to what is the limit state when all parameters are 
zero. 

4.1 The n-tuple of rf-dimensional systems 



As explained in section [3] for the classical case, our theory will be local in nature, so we will be 
interested in a (shrinking) neighbourhood of an arbitrary but fixed faithful state 



PO = 



f^l 













^^2 















fj.d 



with /ii > /i2 > • ■ ■ > Md > Oj 



(4.1) 



which for technical reasons is chosen to have different eigenvalues. A sufficiently small neighbourhood 
of Po in the state space can be parametrised by 6* := (u, Q as follows 



Hi + Ui Cl,2 

Cl,2 M2 + M2 



C* 



Cd-l,d 
Cd~l,d IJ-d — '}2,i=l ^i 



u^ e M, Cj,fc e 



(4.2) 



Indeed, note that if 9 is small enough then pg is a density matrix. 

Let 5 := infi<j<d/ii — P-i+i, with p,d+i — 0, be the separation between the eigenvalues. In the first 
order in 9 /v5, the family pe is obtained by first perturbing the diagonal elements of po with u and 
then performing a small unitary transformation with 



f/(C) := exp 



E 



yl<j<k<d 



Re{Cj^k)Tj,k + Im(Cj\fc)Tfc j 
VMj - Pk 



(4.3) 
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where Tj^ are generators of the Lie algebra of SU{d) defined in (17. 2p . The advantage of the latter 
parametrisation is that we can fully exploit the machinery of irreducible group representations. For 
this reason, in all subsequent computations we will work with the 'unitary' family 



Pe 



uiC) 



fll +Ul 







^J■2 +U2 










C/*(C), u,€R, Q,k€C. 



(4.4) 



but we keep in mind the relationship with (|4.2p . 

As in the classical case, the parameter 9 will be scaled by the factor l/\/n meaning that we zoom in 
around po with the rate equal to the typical estimation rate based on n samples. Let p^'" :— Pg?/- 
and let Q„ be the sequence of statistical experiments 

Q„ := {/■" : e e e„} , (4.5) 

consisting of n systems, each one prepared in a state Pg/^ situated in a local neighborhood of po- 
The local parameter 9 — {~u , C ) belongs to a neighborhood 0„ of the origin of W^^^ x C'*'^'*^^^/^ 
which is allowed to grow slowly with n in a way that will be made precise later. 

One of the principal tools in our result is the representation theory of the special unitary group 
SU{d). Due to lack of space we shall not include any proofs and refer to [10l[T4l[Tl] for details. In 
particular we will be working with the well known tensor representation which will be analysed in 
increasing depth across the following sections. 



The space (C'*)'*" carries two commuting group representations: that of SU{d) given by 
^n{U) : iVi) ®---® Hn) ^ U\iPi) ® • • • ® C/|V'„), U £ SUid), 



and that of the permutation group S{n) given by 



(g) \iJr- 



Hn)) 



T e Sin). 



(4.6) 



(4.7) 



Since the two group representations commute with each other, the representation space decomposes 
into a direct sum of tensor products of irreducible representations. It turns out that the irreducible 
representations of SU{d) and S{n) are indexed by Young diagrams with d rows for the former and n 
boxes for the latter. A Young diagram is defined by a tuple of ordered integers A = (Ai > A2 • • • > Afe) 
with \i the number of boxes on row i (see Figure[T|). As we shall see later this pictorial representation 



Figure 1: Young diagram with A = (5, 3, 3, 2). 

will be very useful in understanding the structure of the irreducible representations (TYajTta) of 
SU{d). 
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The following theorem called Schur- Weyl duality shows that the only tensor products appearing in 
the above mentioned direct sum are those of irreducible representations indexed by the same A, and 
in particular the algebras generated by 7r„(w) and respectively TTdir) are each other's commutant! 

Theorem 4.1. Let 7r„ and Tr^ be the representations of SU{d) and respectively S{n) on (C'')'*". 
Then the representation space decomposes into a direct sum of tensor products of irreducible repre- 
sentations of SU{d) and S{n) indexed by Young diagrams with d lines and n boxes: 

A 

TTn = 07rA«ll/C;,, 
A 

X 

In particular p^'" — pf?^ and TTdir) commute for all r. Hence we have the block diagonal form for 
the joint states 

^«^n^0p«VA"®^, (4.8) 

where M„(A) is the dimension of IC\, p^^ is a probability distribution over the Young diagrams, 
and p_^'" is a density matrix on Tix. From (|4.4p and the Schur- Weyl duality, we get the expression 
of the block states 

pt'" = C/a(C7V^)Pa'°'" C^a(C7V^)*- (4.9) 

We interpret the decomposition (|4.8p as follows: by doing a 'which block' measurement we obtain 
information about 9 through the probability density p^'"- ^^ ^^"-^ ^^ i^ easy to see that p^^^ does not 
depend on ^, so it only gives information about the diagonal parameters u. Later on we shall see 
that the model p^'" has the same limit as the classical multinomial model described in Example 13.51 
Once this information has been obtained, one still possesses a conditional quantum state p^f^ . It 
turns out that this state carries information about the rotation parameters C, and we will show that 
the statistical model described by the conditional state converges to a 'purely quantum' Gaussian 
shift experiment. 

4.2 Displaced thermal equilibrium states of a harmonic oscillator 

The ground state of a quantum harmonic oscillator or the laser state of a monochromatic light pulse 
are well known examples of quantum Gaussian states. Both physical systems are described by the 
same algebra of observables generated by the canonical 'position' and 'momentum' observables Q 
and P satisfying the Heisenberg commutation relation 

QP-PQ = il. (4.10) 

These observables can be represented on the Hilbert space L^(M) as 

{Ci4'){x)^x4,{x), (P^)(x)=-z^(x), ^eL\R). (4.11) 

dx 
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The space L^(M) has a special orthonormal basis {|0) , |1) , . . . } with the vector \m) given by 

i/„(x)e-^'/V(V^2"m!)i/2, 

where Hm are the Hermite polynomials. These are the eigenvectors of the number operator N := 
i(Q^ + P^ — 1) counting the number of 'excitations' of the oscillator or the number of photons in 
the case of the light beam, such that N |?7i) = m I'm). 

The creation and annihilation operators 

a* = (Q - iP)/\/2, a=(Q + iP)/\/2 

satisfy [a, a*] = 1 and act as 'ladder' operators on the basis tpk'- 



a \m) — \frn |to — 1) , a* \rfi) = ym+l |?7i + 1) . 

In particular the following identity holds: N = a*a. 

It can be easily checked that both Q and P have Gaussian distribution with respect to the vacuum 
state |0). In fact they are 'jointly Gaussian' 

(0| cxp(mQ + ivP)\ |0) = exp (-j{u^ + v^[ 
We will often use the complex form of the unitary Weyl operators 

W{z) := exp(za* — za) — cxp{ipoQ - iqoP), z ^ {qo + ipo)/V2 e C, 

which satisfy the Weyl relations 

W{zyW{z')W{z) = exp (2ilm(z'z)) W{z'). 
The coherent (vector) states \z) are obtained by displacing the vacuum state with Weyl operators 

\z) := Wiz) |0) = exp(-|z|V2) f] ^ |m) . (4.12) 

They are Gaussian states with the same variance as the vacuum, and means (z| Q \z) = v2Re(z) 
and (z|P|z) == V2Im(z): 



{z\ W{z') \z) = exp [-\\z - zT + 2iluY{z'z)\ 



Besides, coherent states, an important role in our discussion will be played by the thermal equilibrium 
states. For every /3 > we define the Gaussian state 

Its density matrix consisting of a mixture of A;-photon states with geometrical weights 

CO 

*;=0 
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and can also be obtained by 'smearing' the coherent states with a Gaussian kernel: 

^p = "—^ f exp {-{e^ - l)|z|2) \z) {z\ dz. (4.15) 

The thermal equilibrium states can be shifted in 'phase space' by means of displacement operations 
D^ which act by adjoining with unitaries W{z), i.e. 

£»"(•) := Ad[W^(z)](-) = Wiz)* ■ Wiz). 

The result is a Gaussian state (j)% with the same variance as (j)fj and the same means as \z): 

^^W{z')):=exp(^-j^^^^ + 2tlm{z'z)y <py.= D^^ p) -.^ W {z)* ^ pW (z) . (4.16) 

4.3 The multimode Fock space and the hmit Gaussian shift experiment 

We now consider d{d — l)/2 commuting harmonic oscillators, with a joint state consisting of inde- 
pendent Gaussian states. Let us define the multimode Fock space 

T:= (g) L\R), 

l<j<k<d 

in which we identify the number basis 

|m) = (g) |mj- fe> , m = {m,-feeN: j<fc}. (4.17) 

j<k 

For each of the oscillators we define the thermal equilibrium state 

^j,fc - *ft.., P3,k = Hf^j/f^k), (4.18) 

where {/xi, . . . , ^d} are the eigenvalues of the density matrix po (cf. (14. ip ). We now use the Weyl 
operators to displace these states by an amount proportional to the off-diagonal elements Cj.fe of p^ 
(cf. g21) and glD) 

$C.,. .^ ^ ( Q.k \ * ^ . ^ f — 0,^ 



J'*= \2^pj-pkJ \2^p.j-pk 

We now define the joint state 0'' of the oscillators with density matrix 



'^^ = <^'^fk eTi{T), (4.19) 



j<k 
where Ti{T) is the space of trace-class operators on T . 

The states $^ form the quantum part of the limit Gaussian experiment. The classical part is identical 
to the {d— l)-dimensional Gaussian shift model N{u, V{p)) of Example 13. 51 where /i = {/ii, . . . , pd}- 

Definition 4.2. On the algebra L°"{W^^^) ® B{!F) we define normal state (f)^ with density 

$® ■.= N{u,V{p))®<^^ eL^{«^-^)®Ti{T), (4.20) 

where Af{u,V{p)) is the Gaussian density of Example \3.5[ The quantum-classical Gaussian experi- 
ment TZ is defined by 

7^ = {$« : 61 = (u, C) e K'^-i X C-^id-i)/^}, 
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4.4 The main theorem 

We are now ready to formulate the main result of the paper. In view of subsequent application to 
optimal state estimation, it is essential to consider (slowly) growing domains of the local parameters. 
For given /3, 7 > we define 



e„.0,^ = {(C,K):||Cl|oo<n^|lu|L<n^} 



Recall that 6 is the separation between the eigenvalues of po given by equation (|4.ip . Though we use 
parametrisation (|4.4p for density matrices pe, recall that in the first order this is approximated by pe 
defined in (|4.2p . In fact it can be shown that the same theorem holds for the latter parametrisation. 

Theorem 4.3. Let S > 0, let (3 < 1/9 and 7 < 1/4. Let the quantum experiments 

where p^^" = pfJVj is the state on M ((C'')'*") given by equation (|i^ . and $^ is given by ((i?^ . 
Then, there exist channels (completely positive, normalised maps) 

r„ : M(C'')®"^ii(M''-i)®Ti(j^) (4.21) 

Sn ■■ i^(R'*"^)®ri(J^)-> Af(C'*)®" (4.22) 

with Ti{J-) is the space of trace-class operators on T , such that 

sup \\<i>'~T,,{p'^")\\^=0{n'yS), (4.23) 

sup \\Sn{^'>)~p'^"\\=0{n-yS), (4.24) 

where e > depends only on S, j3 and 7. Ln particular we have 

lim A(Q„,7^„)-0, 

n — ^00 

where A(-, •) is the Le Cam distance defined in (12. 5p . 

In other words, we get polynomial speed of convergence of the approximation, which is enough to 
build two-step evaluation strategies in the finite experiments globally asymptotically equivalent to 
strategies in the limit experiment [17] . 

4.5 The relation between LAN and CLT 

One way to think of local asymptotic normality is the following: we would like to understand 
the asymptotic behaviour of the collective (fluctuation) observables (|4.27p with respect to a whole 
neighborhood of the state p, how the limit distribution changes as we change the reference state p**". 

The quantum Central Limit Theorem describes the asymptotic behaviour of the same observables 
with respect to a fixed state, and is one of the ingredients in the proof of a different version of LAN 
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based on weak convergence |16| . However, in the case of strong convergence, which is the object of 
this paper, CLT does not play any role since we are interested in convergence in norm rather than 
in distribution, and uniformly over a range of parameters. 

The purpose of the section is to derive a 'coordinate free' version of the limit Gaussian experiment 
using the Central Limit Theorem and the notion of symmetric logarithmic derivative. The reader 
interested in the proof of main theorem can skip the following pages and continue with section [5j 

4.5.1 Quantum Central Limit Theorem 

Let p be a fixed faithful state on M{C^). To p we associate an algebra of canonical commutation 
relations carrying a Gaussian state <j). The Quantum Central Limit Theorem [37] says that (j> is the 
limit distribution of certain multi-particle observables with respect to of product states p®". 

Let 

{A,B)p:^TiipAoB), where AoB:= , 

be a positive inner product on the real linear space of sclfadjoint operators M{C^)sa- We define the 
Hilbert space with inner product (•, •)p. 

L^p) = {Ae M{C'')sa ■■ Tr(Ap) = 0}. 

Let a be the symplectic form on L^ [p) 

a(AS) = ^Tr(p[A,i?]). 

The C*-algebra of canonical commutation relations CCR{L^{p), a) is generated by the Weyl oper- 
ators W^(A) satisfying the relations 

W{A)* ^W{-A), W{A)W{B) = W{A + B)eyiY>{-ia{A,B)), A,B e L^{p). 

On CCR{L^{p),a) we define the Gaussian (quasifree) state 

HWiA)) := exp (-^PH^) , \\A\\l = iA,A),. (4.25) 

The state (f> is regular, i.e. there exists a representation (7r,7i) of the algebra CCR{L^{p),a) such 
that the one parameter family t i-^ TT{W{tA)) is weakly continuous and is a normal state on the 
von Neumann algebra generated by tt{CCR{L^{p), a)). This means that there exist selfadjoint 'field 
operators' B{A) such that TT{W{tA)) = exp{itB{A)), and there exists a density matrix <i>^ e '?i('W) 
such that 

q^iWiA)) = Tr (exp(^B(A))$,) , A G L^{p). 

The representation (tt, H) can be obtained through the GNS construction, or by 'diagonalising' the 
CCR algebra as we will see in a moment. From (|4.25p we deduce that the distribution of B{A) with 
respect to is a centred normal distribution with variance ||^||p. From the Weyl relations it follows 
that the fields satisfy the following canonical commutation relations 

[B{A),B{C)] = 2ia{A, C)l, A,C e L^{p). 
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Consider now the tensor product {S)fc=i M{C'^) which is generated by elements of the form 

A^*^) = l(g)...(g)A(g)---(g)l, (4.26) 

with A acting on the fc-th position of the tensor product. We are interested in the asymptotics as 
n — + CX3 of the joint distribution under the state p®", of 'fluctuation' elements of the form 

n 

F„(A):=^VaW. (4.27) 

Theorem 4.4. [Quantum CLT] Let Ai, . . . ,As £ L'^{p). Then the following holds 
Jim Tr (^p^- (n^"(^'))) = ^ (n (BiAO)^ , 
lim Tr (p«" [f[exp{tF,Mi))] ] ^^\f[W{Ai)] . 



\i=i / / \/=i 



Although the algebra CCR{L^{p), a) may look rather abstract, its structure can be easily understood 
by 'diagonalising' it. Let us assume that p is a diagonal matrix po — Diag(/J,i, . . . ,fJ.d)- The Hilbert 
space L'^ipo) decomposes as direct sum of orthogonal subspaces Tipg © H^^ where 

Hp,, := Lm{A : [A, po] = 0, Tr(Apo) - 0}, and 7i^, = Lin{r_,- fc, j ^ k}, (4.28) 

with Tj^k the generators of the su(d) algebra defined in (|7.2p . 



The elements VF(A) with A g TCpg generate the center of the algebra which is isomorphic to the 
algebra of bounded continuous functions Cb(R'^~^). Explicitly, we identify the coordinates in M.'^~^ 
with the basis {di — —fil + Ei^i : z — 1, . . . cf — 1} of Tipg, (see (|7.2p for the definition of Ei^i). Then 
the covariance matrix for the basis vectors is 

{di,dj)p„ == Tr(porfirfj) = 5i^jPi - /i^/^j = [V{p)\rj, 

where V^ is the covariance matrix (j3.3p . 

Moreover 

<j,fc := T,,k/^j2ifij-pk), J ^ k, (4.29) 

form an orthogonal and symplectic basis of Ti.^^^ , i.e. 

<^(tj,k:tk,j) = -^/2, j < k, a.nd(T{tj^k,ti,7n) ^0 ior {j,k} ^ {l,m}. 

which means that {tj.fe, tkj} generate isomorphic algebras of quantum harmonic oscillator which we 
denote by CCR{C). From 

l|ij,fe||po=Tr(potL) 



^.,.,U-Tr(pot,„)-2(M,-M.) 



and (|4.13p we conclude that each of the oscillators is prepared independently in the thermal equi- 
librium state (jjj^k = <t>i3j,k '^ith Pj^k = ^'^{fJ'il Pk)- 
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Based on the discussion of sections 14.21 and 14.31 we can choose Ti :— Lp'iW^^^) ® T and define 
the regular representation tt of CCR{L^{po),<j) on this space in a straightforward way and its von 
Neumann completion is i°°(M''^^) (g) B{J-). The state decomposes as 

^^iV(O,y^)®(g)0,,fc. (4.30) 

j<k 

which is precisely the state (j)^ for 9 = {u, () ~ (0, 0), defined in (|4.20p . 

4.5.2 The quantum Gaussian shift experiment through Fisher information 

We complete the family of states (p^ of the experiment TZ by shifting 0° with the help of symmetric 
logarithmic derivatives. As in the classical case, this will be a family of Gaussian states with the 
same covariance, and mean proportional to the local parameter 6. The covariance is related to the 
Fisher information matrix as described in Remark 13.41 Thus we will start by defining the quantum 
analogues of the score functions and the Fisher information matrix for the full quantum model pg. 

Let us define the symmetric logarithm,ic derivatives [271128] as the solutions in L'^{pq) of 



(re) dpe 

dReCj,k 



C)J opa = 



^(»m) dpe 



^P<^ 



?=o 



Then with Hj,k, Ei^^ defined in ([7\2|) 

^j.k = Hk^j/{pj + pk), >Cv™' = Hj,k/{pj + Pk), ii = Ei^i/pi - Ed,d/Pd, 

and the quantum Fisher information matrix consists of a 'classical block' that coincides with that 
of the classical multinomial model in (13.21) 



[Ipoh ■= i^tJ])pa = [iMh, l<ij <d- 1, 

and a 'purely quantum' block given by the diagonal matrix 

Hp„ - Diag {WCjMllo^ IIAj||p„ ■■j<k)^ Diag {{p, + pk)-\ {p, + Pk)-' ■ ] < k) . 
Lemma 4.5. Let 

j<k i 

Consider the representation (7r,7i) of CCR{L^{pQ),a) and the normal state (j) on L°° {M.'^^^) (E) B{T) 
as defined in the previous section (cf. \4.30^. Let cj)^ be the state defined by 



/(W(A)):=exp(--P,A||p„+*(A/:W)po), AeL\po). (4.31) 



Then (jf is normal with respect to the representation [tt^TC] and coincides with (ff (cf. (|4.20p ). 

Remark 4.6. The expression (|4.3ip is clearly the quantum analogue of the characteristic function 
of the classical Gaussian shift experiment (|3.H) . Note in particular that the distribution of B{A) 
with respect to (pe is the normal with variance ||^||L centred at (A,£(0))pp. 
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Proof. From (|4.28p - (|4.3ip , and by expressing A in the symplectic basis (I4.29P 

^ = ^ {U],ktj,k + VjMtk.j) + ^ Wiii, 
j<k i 



we get 



2 N fJ'j + Mfc 



\A\\l, - -%.- + EK. + -f,.)2(^^^> (4-32) 



(Amu = ^"7,„. + ^ "^-^-^^^"-^■^-^^^-^^ (4.33) 



which imphes that the foUowing decomposition holds 

/ - 7V(/p„u, /pj ® (g) </>5;,'" := 7V(/p„w, /pj $5 ^^^ (4.34) 

j<k 

where we used the fohowing expression for the displaces thermal equilibrium states (j)-^j^ = 0^ defined 
in (|4.16p . with /? — lufij/^k, z — Cj,fc 

'■' ^ y ^\^ ^ '4(Ai,-/i/c) v/2(mj - Mfe) ; 

n 

5 Explicit form of the channels and first steps of the proof 

5.1 Second look at the irreducible representations of SU{d) 

Before explaining the steps involved in the proof, let us take a closer look at the block states (|4.9p . 
Recall that we have the decomposition of Theorem 14.11 over Young diagrams with n boxes and 



®A'-^T^ 



B.n _ n\ J,n ^ ^K 



^ Mn{\) 

Let {/i, . . . , fci\ be the eigenvectors of poi i-e. the standard basis vectors of C*. Then the eigenvectors 
of pf"" — p°^" are tensor products 

/a := Ia(l) ® /q(2) «)•••«) fa(n), 

and the eigenvalues J^j. Xatk) do not depend on the order of the vectors in the product. 

5.1.1 Projecting onto a copy oi Tix. 

Our aim is to 'project' to an irreducible representation Tix and obtain an explicit expression for the 
eigenvectors of the block components p^'". Such a projection is not unique, in fact for any rank 
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Figure 2: Left: a standard Young tableaux. Right: a semi-standard Young tableau for d = 3 

one operator \v){u\ £ B{K.\) with {u\v) = 1 we can define a (not necessarily orthogonal) projection 
y = y^ on a copy of H\ 

y^{u, v) := In, ® \v){u\ : (C^)®" ^ 7^^ ® b). 

However the action of y\(u,v) on basis vectors /a depends on a particular identification between 
(C^)®" and the direct sum in Theorem 14.11 Therefore we need a direct way of defining such a 
projection and the key observation is that yx{u, v) is a minimal projection in the algebra Alg(7fd(T) : 
T € S{n)), i.e. it cannot be decomposed into a sum of non-zero projections, and vice-versa any 
minimal projection is of this form. The following recipe (given without proof) shows how to construct 
minimal projections in the S{n) group algebra. We recall that the group *-algebra A{S{n)) is the 
linear space spanned by the group elements endowed with a product stemming from the group 
product 

a= ^ a{T)T, 6= X! ^(^)^ =^ ab^ ^ a{T)b{g)Tg = ^ ^ a{as^^)b{s) \ a, 

TGS(n) geS{n) T,geS{n) creS{n) \s6S(n) / 

and with adjoint a* — X^reSfn) o-{''')t^^- 

Let A be a Young diagram with n boxes consider the (standard) Young tableau t in which the boxes 
are filled with the numbers {1, . . . , ti} in increasing order from left to right along rows, starting with 
the top row and ending with the bottom row, as shown in the left-side tableau of Figure [H 

Define the group algebra elements 

where TZ\ is the S'(n)-subgroup of permutation leaving the rows of i invariant, and C\ is the subgroup 
of permutations leaving the columns of t invariant. Note that Pa and Qx are self-adjoint elements 
of the S{n) group algebra satisfying 

d d 

PxPx = \nx\Px = {Y[Kl)Px, QxQx = \C{X}\Qx = (l[t^'-^'+')Qx- (5.1) 

The Young symmetriser is defined as 

Yx := QxPx- 

Theorem 5.1. Up to a scalar normalising factor, the Young symmetriser Yx is minimal projection 
in A{S{n)) and yx := qxPx = t^ d{Q x)tt d{Px) projects onto a copy of Hx C (C'')®". 

The action of the Young symmetriser yx on basis vectors /a E (c^)®" follows easily from the 
definition of Yx- For each /a we fill the boxes of A with the indices a{k) going along rows from left 
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to right, starting with the top row and finishing with the bottom one. For example, if A = | | | I 
and fa — f2®f2®fi®l2® /i then ia = ^^- S{n) has an obvious action on the set of tableaux 
by permuting the content of the boxes which arc numbered from 1 to n in the standard way as in 
Figure [21 The action of the Young symmetriser yx = q\p\ on /a is deduced from the action on the 
tableau ia : one first symmetrises with respect to components which are in the same row, and then 
antisymmetrises with respect to components in the same column. For example if A = ^^ then 

2/a(/2 ® /i «) /a) = h® h<^ h + h® h® h- h® h® h- h® h® h- 



5.1.2 Finding a basis in Tix 

By the previous Theorem the vectors yxfa span TYa, but are not linearly independent. We show now 
how to select a basis ( subset of linearly independent vectors spanning TLx). A semistandard Young 
tableau is a diagram filled with numbers in {1, . . . , d} such that the entries are non-decreasing along 
rows from left to right and increasing along columns from top to bottom, as in the right-side of 
Figure [H 

Theorem 5.2. The vectors yxfa. for which t^ is a semistandard Young tableau form a (non- 
orthogonal) basis of the irreducible representation {■nxi'Hx)- 

Since the values in the rows are nondecreasing, there is a one-to-one correspondence between Young 
tableaux t^ and vectors m = {mij)i<ci<^j<d where rriij is the number of j's appearing in line i of 
the Young tableau t^. Note that we need only consider m^j for j > i, as there is no j in line i if 
j < i (the columns are increasing), and the number of i in line i is Ai — X]7=i+i '^«j- F'or example, 

l | 2|3f3l 



if ia = msl then m = {mi^2 = 1, rni^3 = 2, 7712,3 = !}• 

By a slight abuse of notation we shall denote the corresponding vectors by yxfm and the normalised 
vectors 

|m,A):=AA(m,A)yA/m, (5.2) 

where A/'(ni, A) = l/||yA/m|| ■ This constant is in general not easy to compute but we will describe 
its asymptotic properties in section FTSl 

Using (|5.ip we have 

d 

{y\feL\y\fh) = {q\p\fa\q\p\fh) = {pxfsiklp^fh) = (Y[i^'^^'*'){p\feL\y\fh)- (5.3) 

In order to get further simplifications, we examine some special vector states, that we shall call by 
analogy with the Fock spaces finite- dimensional coherent states. 

The first is the special vector |0, A), the highest weight vector of the representation {nx^Tix), which 
later on will play the role of the finite- dim,ensional vacuum. This vector, as we have seen, corresponds 
to the semi-standard Young tableau where all the entries in row i are i. An immediate consequence 
is that 

d 

PA|/o) = (nA.!)l/o>- (5.4) 
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Moreover (/oI'Za/o) — 1 since any column permutation produces a vector orthogonal to /q. Thus 
the normalised vector is: 



|0,A> 



1 



ntiA.!%A^ 



=yA|/o) 



(5.5) 



The finite-dimensional coherent states are defined as ttx{U)\Ox) for U G SU{d). From [px, irxiU)] = 
and (EH), we get px7rx{U)\0x) = (Jlti ^^^■)U\Ox), thus 

{yxU7rxiU)\0,\) = 



[]z^.-^-+i(pA/m|gA^A(C/)/o) 



(5.6) 



The latter expression holds for any linear combination of /m on the left-hand side, in particular 
7rACl^)/o for another unitary operator V. In Lemma mi we shall examine asymptotics of (|5.6p for 
specific sequences of unitaries U when n — > oo. One of the main tools will be formula (|7.6p . 

The following expressions of the dimensions of /Ca and Tix are given without proof. 

Let gi^m be the hook length of the box (I, m), defined as one plus the number of boxes under plus the 

number of boxes to the right. For example the diagram (5, 3, 3) has the hook lengths 
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The dimension Af„(A) oi ICx is 



M„(A) = 



n l=l...d 9l,m 

771— 1... A/ 



and can be rewritten in the following form which is more adapted to our needs: 



M„(A) = 



Xi, . . . ,Xd 



n 



1 = 1. ..d 
k=l + l...c 



Xi — Xk + k ~ I 
Xi+k-l 



The dimension I?(A) of Hx is: 



2?(A)= n 



i=l...d 
j=l...Xi 



j + d-i 



(5.7) 



(5.8) 



To summarise, we have defined a non-orthonormal basis {|m. A)} of Hx such that |m. A) are eigen- 
vectors of p*^'"'"- for all A, with eigenvalues: 



d d / u,n ' 

(m,A|p«-"'"|m,A)=n(A*f")'' U ""' 



(5.9) 



1=1 



j=i+l \/^i / 

where ^"'" ^ fii + Ui/^ for 1 < i < (d - 1) and /i^'" = fid - iJ2i Ui)/y/n. 

The next step is to take into account the action of the unitary U{Q). We define the automorphism 
of the n-particles algebra 

by 



A'^'" : Af((C'')^") -> Af((C'^)^"), 
A ^ A'^"''"(A) - Ad[U{C,n)]{A) := C/(C/V^)®" AC/*(C/V^)^", 



(5.10) 
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In particular we have p'^^"^"- — A^'"(p°'"'"). By Theorem 14.11 and using the decomposition (14. 8p . we 
get the blockwise action on irreducible components 

A 

where A^'" = K(i^\[C,. n)\. In particular we have 

Px = ^x (Pa )■ (5-11) 

With these notations, we can set about building the channels T„. 

5.2 Description of T„ 

We look for channels 



T„ : Af ((C*)®") ^ L^R"^-^) ® Ti{T) 



of the form: 



T„ : /■" ^ Y.P^'^ ^A ® {v.p'^V^) . (5.12) 

A 

Here, Vx is an isometry from Tix to T, i.e. V^*Va = '^Hx- O"^ the classical side, t^ is a probability 
law on W^~ "^ . We may view t" as a Markov kernel (|2.2p from the set of diagrams A to W^^ ^ . 

The channel T„ can be described by the following sequence of operations. We first performs a 'which 
block' measurement over the irreducible representations and get a result A. Then, on the one hand, 
we apply a classical randomization to A, and on the other hand we apply a channel depending on 
our result A to the conditional state p\. 

The underlying ideas are the following. 

1). The probability distribution p^^'' is essentially a multinomial depending only on u, as it can be 
deduced from (15. 9p and (|5.7p . As we have seen in Example 13.51 this converges (in Le Cam sense) to 
a classical Gaussian shift experiment. Here, in order to obtain the strong norm convergence we need 
to smooth the discrete distribution into a continuous one with respect to the Lebesgue measure. 
We choose a particular smoothing distribution which will insure the uniform L^ convergence to the 
Gaussian model (Lemma 16. ip . 

Definition 5.3. Let t"^ he the probability density on W^^^ defined for all A such that "^Xi — n, by: 

Tl(dx) = Tl(x)dx = dxn^^-^y^iAx^n), (5.13) 

where A\,n = {x e R''^^ : \n^/'^Xi + npLi - Ai| < 1/2, 1 <i <d-\}. We further denote 

"a -Px ^A' 
depending on 6 only through u. 
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2). For the quantum part, we map the 'finite-dimensional vacuum' |0, A) to the Fock space vacuum 
|0), and the basis vectors |m, A) of Hx 'near' the basis vectors |m) of the Fock space T (cf. defi- 
nitions (|5.2p and respectively (|4.17p V Here we need to tackle the problem that {|m, A)} is not an 
orthonormal basis but only becomes so asymptotically. The following lemma provides the isometry 
V\ appearing in (j5.12|) . 

Lemma 5.4. Let rj < 2/9. Suppose that Xi — Ai+i > Sn for all \ < i < d, with the convention 
\d+i = 0. Then for n > no{ri,6,d) there exists an isometry Vx : Tix -^ T such that, y|0, A) = |0) 
and for |m| < n'', 

(m| Vx = , (m, A| 

y^l -t- (Cn)(9"-2)/i2/,5i/3 

where C — C{rj,d) is a particular constant. More precisely, Uq can he taken of the form {C {d) / 6'^)^' '■^^^''^\ 

Proof. See section [721 The main tool is Lemma [731 

D 

For Young diagrams which do not satisfy the assumption of the previous Lemma, the isometry Vx 
can be defined arbitrarily. The reason is that fact that those blocks have vanishing collective weight 
and can be neglected altogether (cf. Lemma [ 



From this operational description we conclude that T„ is a proper channel since r" is a Markov 
kernel and Vx is an isometry. We then want to prove that Ta(p_^'"'") is close to $" and that the 
finite-dimensional operations A^'" have almost the same action as the displacement operators D'' of 
the Fock space, cf. (|4.16p . Finite-dimensional coherent states and formula [4. 151 will be the stepping 
stone to those results. 



6 Main steps of the proof 

6.1 Why Tn does the work 

We shall break (|4.23[) in small manageable pieces. The result and brief explanatory remarks, repeat- 
ing those in the derivation, are given from (|6.3p on. 

We introduce first a few shorthand notations: the restriction of T„ to the block A is 

Tx ■■ Px "^ y\Px ^A : 
so that 



Tn- P ' ^l^Px'^X® Tx{p^' ) = 2^ &A 
A A 

We also define T^ : cj) ^ V^(j)Vx. and note that T*Tx = Id^^. 



30 



We expand (|5.12p as 






Niu, V,) ® / - N{u, V,) - J2 bT U / - E ^a' 






Proving (|4.23p then amounts to proving 



sup 



E^a"«0 



< Cn-'/'^. 



We now use the triangle inequahty to upper bound this norm by a sum of "elementary" terms to be 
treated separately in the following sections. 



V(^, v,)-Y, b'A ® 0?V E ^A " ® (<? 



< 






E|hA'"«(/-/A") 



E|hAi|J|(/-<i 



< 



Since ll/lli = \\U{u,V^)\\i = U'x'W = 1, we have || (/- 0«-") ||^ < 2. Similarly EaII^a"IIi = 1 



because ||6 

( 



A 



1 = Pa"- ^^ split the sum over A in two parts, one for which it is expected that 



e.ni 



is small, and the other on which the sum of all ||6;^' ||i is small. Specifically, define 



the set of typical Young diagrams 



A, 



{A : |A, - nn^\ < n", 1 < i < d}, for a > 1/2, 



(6.1) 



then 






< 



sup 

AeA„ c 



, + 2 E 11^' 



A 111- 



(6.2) 



A^A„ 



The first term corresponds to the convergence of the classical experiment in the Le Cam sense. If 
the second term is small, then on A„^q,, the (purely quantum) family p^'" is near the family cj)''. The 
last term corresponds to the other representations. If it is small, it means that there is concentration 
of Pj,^'" around the representations with shape Xi = nfii. In other words, the only representations 
that matter are those in A„^q,, there is almost no mass on the other representations. 

The hardest term to dominate (notice that the two others are classical) is the second. We transform 
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it until we reach tractable fragments. 



h< 



< 



DH^") - Di\Tx{p°:^n) + [Di - T,Aj"Ti:](r;,(p°'"'") - 0") 



[i?^"-rAAi^"r;](0O) 



<3 



T[ / 0,U,7l\ 



[i?f-TAA^'"Ti:](/) 



where in the last inequality we used the fact that the displacement operators are isometrics. 

Note that the first term does not depend on ^ and the second term is small if the displacement 
operators A^'" and D'' have 'similar action' on an appropriate domain. Using the integral formula 
(|4.15p for gaussian states cfip and the fact that 0°, is a tensor product of such states (cf. (|4.34|) ') we 
bound the second term by 



[^c_r^Ai^"r;](<^°) 



< 



Cd(£i-l)/2 



f{z) [Di'-TxA'x-n]{\z) 



dz 



where 



m = n 



i<j 



Pi - Pj 



exp 



JM-_Jij_ 
Pj 



and \z) {z\ — _D^(|0) (0|) is the multimode coherent state, so 

[i?f-T,Aj"T;](|z) (z1) = [D^^D'-TxA^^-nD%0) (0|). 

Now, / is a probability density, and the norm in the integrand is dominated by two. By splitting 
the integral we obtain 



[D^ -Tx/4:''n]{<jP] 



< 2 



f{z)dz+ sup 



[D'^D^^ -TxAi;''T*^D^]i\0) (0\ 



By adding and subtracting additional terms 



D'^D^ - TxA'^^''T*D'' ^D^+^ - TaA^+^^'T; 



+ Ja^a ^a^-'a^a ^a ^a 
+ TAAj"Af'"T;^ - TxAl;''TlD' 



we deduce that 



[D^^-TxAtrm^ 



< 



[D<+'-TxAi-'^~'-T*]{\0){0\) 



[Ap'"-Ai-"Ar](|0,A)(0,A|) 
[A',"n-nD']{\0){0 
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where the last two terms on the right side have been simpUficd using properties of T;v,r^, A^'". 
Notice that the first and third norms are essentially the same and the three terms are small if the 
action of A^ is mapped into that of the displacement operators D^ . 



Putting all this together, our 'expanded' form for (|4.23|) is 



sup 



T^{p'>n~<i>^®^{n,V^) 



< sup 



AA(u,F,)-^6t'" 



2 sup yj 

sup 

ef2„ ,3 .^ aga„,q 



\bx 111 



3 sup sup 110" -Tx[p^ ) 



sup sup sup 

||z||<nt' een„,0,^ AeA„,o 

sup sup sup 

||z||<n'3een„,^,^ A6A„,„ 

sup sup sup 

||i-||<n'3een„,^,^ AeA„,Q 



[i^C+?_r^Ai+^~"T;](|0)(0|) 



[D^ 



nAl--T:](\0) (01 



j^C+?,n 



Ai'"Ar](|0,A)(0,A|) 



z\\>nl^ 



f{z)dz. 



(6.3) 

(6.4) 
(6.5) 

(6.6) 
(6.7) 
(6.8) 
(6.9) 
(6.10) 



The last Gaussian tail term is less than Cexp(— Jn^^) where C depends only on the dimension d. 
Under the hypothesis n^'' > 2/(5, this can be bounded again by 0{n~^'^). 

The following lemmas provide upper bounds for each of the terms. Before each lemma we remind 
the reader what is the significance of the bound. The proofs are gathered in section [71 

The classical part of the channel is a Markov kernel r (see definition 15. 3p mapping the 'which block' 
distribution p_^'" into the density b.^^ on R"*"^ which is approaches uniformly the gaussian shift 
experiment (|6.4p . Recall that 6_^'" depends only on u and not on C,, so that we have the same 
parameter set for the two classical experiments. 

Lemma 6.1. With the above definitions, for any e, we have 



sup 



AA(u,F,)-^&t'' 



= 



{n-'^^+y5,n-'/^+ys) 



The next lemma deals with (j6.5p by showing concentration around Young diagrams A in the 'typical 
subset' (j6.ip . This allows we to restrict to this set of diagrams in further estimates. 



Lemma 6.2. Let a — 7— l/2>0. Then, with the above definitions we have 



sup Yl l|foA"lli=OU'^%xp(-n2"-V2) , 



with the 0{-) term converging to zero. 
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The term (16.61) shows that when the rotation parameter is zero, the block states P;^'"'" are essentially 
thermal equilibrium states, as one would expect from the quantum Central Limit Theorem 14.41 
However the convergence here is in norm rather than in distribution, and uniform over the various 
parameters. 

Lemma 6.3. With the above definitions, we have 



sup sup 



t,0 _ rA(pp'") = Oin-'/^+''+yS, „(9')-2)/24/^l/6) 



The terms (|6.7p and (|6.8p show that the 'finite dimensional coherent states' obtained by performing 
small rotations on the 'finite-dimensional vacuum' are uniformly close to their infinite dimensional 
counterparts, thus justifying the coherent state terminology. 

Lemma 6.4. Let e > be such that 2(3 + e < ri < 2/9. 
Then, 



sup sup sup sup 



[£)C+? „ TaA^+^-'^'^T^KIO) (0|) = R{n) 



with 



n-i+"+"5-\ n-i+^^r \ n"'^) (6.11) 



For estimating the terms |6'. 7[ \6.8\) . the case when ^ = is sufficient. This more general form is 
useful for the proof of Lemma \6.5[ The unitar 
C/(C,^, n)) the general SU{d) element of ()7.ip 



useful for the vroof of Lemma \6.5[ The unitary operation is defined as Aj^ " :— Ad[U\{C,^,n)] with 



Finally (|6.9p shows that the 'finite-dimensional' displacement operators multiply as the correspond- 
ing displacement operators when acting on the vacuum. 

Lemma 6.5. With the above definitions, under the same hypotheses as in Lemma\6.4\ we have 



sup sup sup 

l|zll<Ti'3 9en„,^,T AeA„_o 

with R{n) given by equation (j6.1ip . 



[Ai+"^"-Ai'"Ar](|0,A>(0,A|) ^^R{n) 



From the last three lemmas, together with the bound on the remainder integral (j6.10p we obtain the 
following lemma which can be plugged into the bound (|6.2p : 



Lemma 6.6. With the above notations under the same hypotheses as in Lemma\6.4\ we have 



sup sup U"^ - 0t^"|| = R{n) + 0{n~^/^+^+yS + „(9'7-2)/24/^l/6) 

S&Q„,i3,j AeA„_Q 



with R{n) given by equation (j6.1ip . 
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Gathering all these results yield the following theorem which provides the bound (|4.23p . 

Theorem 6.7. For any 6 > 0, I > a > 1/2, r/ < 2/9, e > 0, f3 < {i] + e)/2, 7 < 1/4, the sequence 
of channels Tn satisfies 

sup ||T„(p«'") - <^|| , = 0(n-l/2+/3+^/2^-l/2 ^ „-l/4+^/25-l/4 ^ ^^-l/2+a/2+V2^-l/2^ 

^-1/2+3^2^-1/2 ^ „-/3/2 ^ ^-l/2+7+'Y5 + „(9r,-2)/24/^l/6) (g^^2) 

With any explicit a, (3,^,5, we get an explicit polynomial rate. 

6.2 Definition of S'„ and proof of its efficiency 

The channel 5„ is essentially the inverse of T„ and as we shall see, (I4.24p can be deduced from (|4.23p . 

On the classical side we need a Markov kernel completing the equivalence between the family p^'" 
and A/'(u, V^). Let cr" be defined by 

a" : a; e R^^-^ ^ 5x^ (6.13) 

where A^; is the Young diagram such that ^j^ A^ = n, and \n^''^Xi + njii — Ai| < 1/2, for 2 < i < d. 
No such diagram exists, we set A^; to any admissible value, for example (n, 0, . . . , 0). Notice that 
with (|5.13p . a"" o r" o a"" — cr". Moreover any probability on the A such that J2i ^i = " is in the 
image of u", so that cr" o r"(p^^") = p^'". 

Lemma 6.8. With the above definitions, for any e, we have 

sup ||ct"AA(u,1;J -/^"ll =< CO (n-^/'^+yS,n-^/*+-'/s) . 
\\u\\<m ^ ' 

Proof. See end of section 17.51 

□ 

The channel S'„ is given by the following sequence of operations acting on the two spaces of the 
product L^(M''^^) (E) Ti{!F). Given a sample from the probability distribution N{u, V^), we use the 
Markov kernel a""^ to produce a Young diagram A. Conditional on A we send the quantum part 
through the channel 

with 

Sx:c^^ r;0+ (1 - Tr(r;(0)))|O, A)(0, A|. 

The second term is rather arbitrary and insures that 5a is trace preserving map. What is important 
is that for any density operator p\ on the block A, the operator S\ reverts the action of Ty. 

SxT^ipx) = nr^ipx) + (1 - Tr(r;TA(pA)))|0,A)(0, A| 
= PA + (l-Tr(pA))|0,A)(0,A| 

= Pa- 



ss 



Now 



SniM{u, V^) ® /) = 0[(T"AA(7l, V^)]iX)Sx{<P^) 



Mn{X)' 

and with the notation cr"7Vf := [cr"7V(M, V^))](A) and gf'" := min(cr"7Vf ,pj") we have 



(A) 



Taking L^ norms, and using that all i/i's and p's have trace 1 and that channels (such as S\) are 
trace preserving, we get the bound: 

A A 

<2 Y. 9r+ sup ^A(/)-pt'" +||^"AA(u,Fj-/'"||^ 



A^A„ 



<2 ^ ^r 



A^A„ 



AGA„,o 



sup 

AeA„,Q 



'^^-rA(p''") +\\a^W{u,V,)-p^-%. 



Now the first term is smaller than the remainder term of the gaussian outside a ball whose radius 
is n". Hence this term is going to zero faster than any polynomial, independently on S and u for 
\\u\\ < nT' ■ The second term is treated in Lemma 16.61 (recalling that (/i_>^'" — T\{p^^^)), and the third 
term is treated in Lemma [ 



D 



This ends the proof of (g^l)- 



7 Technical proofs 



7.1 Combinatorial and representation theoretical tools 

Here we continue the analysis of the SU{d) irreducible representations (tt^ , Ti.\) started in section [5TT1 
The purpose of this section is to provide good estimates of quantities of the type (m, A | tt\{U) \ 1, A) 
which will be needed in the proofs of Lemmas 17.31 and 16.41 



We shall use the following form of a general SU{d) element and the shorthand notations 



uiCO 



exp 



l<j<*;<d 



Re(Cj:fe)2"j,fc + lMC].k)Tkj 



i=i ^<-i^k<ri VMi Aife 

U{C,^,n) := U{C/V^,i/V^), C/(C):=C/(C,0), C/(C,n) :- t/(C7%^) 



(7.1) 
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where Hi and T^j are the generators of SU{d) defined by 

Hj = Ejj - Ej+i,.j+i for j <d-l\ 

Tj,k = iEj.k - iEkj for 1 < j < fc < d; 
Tk,j = Ej,k+Ek^j iorl<j<k<d. (7.2) 

with Ei_j the matrix with entry (i,j) equal to 1, and aU others equal to 0. 

We first introduce some new notations and remind the reader about the already existing ones. 

1) We write l{c) for the length of the column c in the Young diagram A. There are then Ai — A^-j-i 
columns such that 1(c) = i. An alternative definition is 1(c) — mf{i : A; > c}. 

2) Recall that we denote by /a the basis vectors /afi) (g) • • • (g) /a(„), and to each vector we associate a 
Young tableau ta where the indices a{i) fill the boxes of a diagram A in a particular way. We denote 
by ta the column c of ta, i.e. the function t^ : {!,..., 1(c)} -^ {1, . . . , d} that associates to the row 
number r the value of the entry of that Young tableau in column c, row r. For example, if ta = \ l \ l \ ^ ^ 
we get the values: 

ti(l) = 2, ti(2) = 2, 4(1) = 2, tl(2) = l, tl(l)^l. 

We shall often be interested in the image ia({lj ■ • • i K'^)}) ^^ unordered set, or compare f^ to W, 
the identity function on the integers {I, . . . ,l(c)}. 

3) Recall also that 7i\ is spanned by the vectors yxfg, for which t^ is a semistandard Young tableau, 
and y\ = q\p\ is the Young symmetriser (cf. Theorem 15. 2p . If t^ is semistandard then we can use 
the alternative notation /m for /a since a is in one-to-one correspondence with m — {niij : 1 < i < 
j < d}, where rriij is the number of j's in the row i of ta. The normalised vectors are 

|m,A) :=yA/m/||2/A/m|l. 

4) Let 0\('m) be the orbit of /m under the subgroup TZ\ of row permutations. This consists of 
vectors /b which have exactly rriij boxes with j in row i, and the rest are i. In particular, row i has 
no entries smaller than i. Since the action of permutations is transitive, we have 

5) Since we antisymmetrize with q\, we are only interested in the ta (not necessarily semistandard) 
which do not have two equal entries in the same column. Such tableaux ta (or vectors /a) shall be 
called admissible and their set is denoted V. 

6) For any /a G 0\(m) we define 

r(/a) := |m| - #{1 < c < Ai : t^ ^ Id=}, 
and denote by V^(m) the set of vectors /a G 0\(m) f] V with r(/a) = F. Then we have 

OA(m)f|V- U^'^^™)- 

reN 

37 



Note that r(/a) > and is zero if and only if each column f^ is either Id^ or of the form f^ir) = 
j5r=i + rSr^i for some i < l{c) < j. A t^ of this form will be called an {i, j) -substitution. 

The following 'algorithm' shows how to build all the possible /a e V'"(ni), thus enabling us to 
estimate the size of V'"(ni). 

Algorithm 

Let (m, A) be fixed but otherwise arbitrary. In order to generate a particular admissible /a G 0\{m.) 
we need to select the rriij boxes on row i which are filled with j, for all i < j. The rest of the boxes 
are filled automatically with i's. The constraint is that no column should have two boxes filled with 
the same number. 

Generating a diagram can be described intuitively as follows. We start with the 'vacuum' vector 
(tableau) /o := /m=o (row i is filled exclusively with i's), and with a set of |m| bricks containing 
rriij identical bricks labelled {i,j), for each pair i < j. To change the content of a box from i into j 
we place an (i, j)-brick in that box. This procedure is repeated until all bricks have been used, each 
box being modified at most once. 

At this stage each column c may contain several bricks placed in the appropriate boxes, so that its 
configuration is uniquely defined by the set of bricks k which shall be called a column-modifier. For 
example if k = {(■*, j), (/, I)} then the column has entries 

j if fc = i; 
tl{k)^{ I iffc = /; 
k otherwise. 

Note that a column-modifier is not an arbitrary collection of bricks but one that can be used to 
produce a column with different entries. In the previous example, if i < / this means either (j ^ f 
and j,l > 1(c)) or (j =: / and I > 1(c)). The elementary one-brick column-modifier denoted K(i,j) 
can only be used in a column with i < 1(c) < j , otherwise the entry j would appear twice. 

Now, since the length of a column is at most d and all entries must be different, there are less than 
d\ different types of column-modifiers. Another important remark is that a column-modifier always 
increases the value of the modified cells, so that in this case ta({li • • • 7 ^(c)}) 7^ {Ij • • • , ^(c)}- 

Alternatively to the above scenario where the bricks are inserted sequentially, we can first cluster 
them into |m| — F column-modifiers, and then apply each column-modifier to a particular column. A 
given collection of column-modifiers is uniquely determined by {ttik : k} where TOk is the multiplicity 
of K. This procedure is detailed in the following 3 stages: 

I. Choose F bricks among our |m|. As we have d(d — l)/2 different types of bricks (recall that 
i > j), and we do not distinguish between identical bricks, there are at most [d(d — l)/2]'" 
possibilities. For F = 0, we have only one choice. 

II. Consider the remaining bricks as a set of elementary column-modifiers. Starting from these, 
we sequentially add each of the F bricks selected in the first stage, to one of these elementary 
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column-modifiers to form non-elementary ones. At each step we have at most d\ different types 
of column modifiers to which we can attach the new brick. Note that we do not distinguish 
between column modifiers of the same type, but rather consider them as an unordered set. 
Hence, we have less that {dl}^ possibilities. 

Note that at the end of stage II at least max{0, |m| — 2r} of the column- modifiers are elemen- 
tary, and that rUi^uj^ < rriij. 

III. Apply the column- modifiers to the columns of /o, so that no two modifiers are applied to the 
same column and the resulting /a € 0\{iLn) is admissible. By construction r(/a) = F and all 
admissible tableaux can be generated in this way. 

For counting the number of possibilities for the third stage we apply the column modifiers sequen- 
tially, but since some of them may be identical we need to divide by the combinatorial factor J^^ rn^l, 
where m^ is the number of column modifiers of type k. 

We distinguish between elementary column modifiers of type K{i,j) and composite ones. There are 
less than n possibilities of inserting a composite column-modifier k. An elementary one of type 
K,(i,j) can only be inserted in a column with at least i rows, and since the resulting vector has to be 
admissible, the column cannot contain another j, so its length is smaller than j. There are A^ — Aj 
such columns. Hence the number of possibilities at stage three of the algorithm is upper bounded 

by 

K^K{i,j) i<3 '"■''^•' 

When F = 0, for each elementary colmim modifier K(i,j) the number of available columns is at least 
(Xi — Xj — |ni|)-|_ := max{0, Xi — Xj — |m|}. Thus we have the following lower bound 

rrii j] 

Notice that the upper bound (|7.4p depends on the set of multiplicities {m,^}. 

We now return to our list of notations and definitions. 

7) To each column of t^ we associated a column modifier which completely determines its content. If 
m^ is the number of columns with column-modifer k, we collect all multiplicities in E := {m'^ : k}. 
In particular F is a function of E 

F(/a) = |m|-^m:. 

Vectors for which F(/a) = have the same multiplicity set E*^ where Tn^/ij\ — rriij for all i < j and 
the other m^ = 0. Similarly to V'"(m), we denote by V^(m) the set of tableaux in OA(ni) P| V with 
E{fa) = E, in particular 

V^(m)=|JV^(m) 
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8) To each column c of ta we associate two disjoint sets: the added entries {ta(l)i • • • i^a(^(c))} \ 
{1, . . . , l{c)} and the deleted entries {1, . . . , l{c)} \ {t'^{l), . . . , t^(/(c))}. This data is placed into a 
single set by attaching a ± sign to each entry, indicating if it is added or deleted. It is easy to 
verify that if ta is admissible, the set of added and deleted entries is uniquely determined by the 
column-modifer k associated to c, and hence shall be denoted by S{k). For example S{K{i,j)) — 
{{i, — ), (j, +)} and for k = {{i,j), (j, k)} we have S{k) = {{i, — ), (fc, +)}. We define the multiphcities 
rrig = J2k-s(k)=s '^k ^^*i ^(/a) '■— {itt-s ■ S} . To summarise, we have defined the maps 

We now state our estimates. The first point of the following lemma is an exact formula serving as 
the main tool to prove some of the bounds below. 

Lemma 7.1. 

1. For any unitary operator U € Af (C''), for any basis vectors /a and /b, we have 

(/akAC/^"/b> = n det((7*^*^), (7.6) 

l<c<Ai 

where [/*a'*b is the l{c) x l{c) minor of U given by [U^'^'*^]i,j — Ut<: (i) ^t^ (j) ■ 

Under the assumptions 

(7.7) 





|m| <«", 




A e A„^„, 


inf l^i - 


- ^J■^+l\ > s, 




tl-d > S, 




llCli<Cn^ /3<l/2 




mi<n-'/'+'^/5, 




/2\ 1/(1-") 




\S J 



we have the following estimates with remainder terms uniform in the eigenvalues /i, ; 
2. The number of admissible /a G 0\{m.) with r(/a) = is 



#V"(m) = n (^^^4^(1 + Oin-'+'Vd)). (7.8) 

3. Let E := {m^ : n} with T{E) = T. The number of admissible /a G 0\{m) with E{fg) ^ E is 
bounded by: 

#V^(m) < n-r+i:,<,(m.„-™.<,.„) TT (^» ' ^^ )"'"""' . (7.9) 
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4- The number of admissible /a G 0\{ni) with r(/a) ~ T is bounded by: 



#Vr(m) < C^n-^S-^^lmrTl^^' ~ ^'^^^"' 



n 



mi. 



(7.10) 



for a constant C — C{d). 



5. Let /a € V'" (1), and consider V^ (m) C 0\{m) for some fixed T^ . Then: 



, /a 

withC = C{d). 
6- ///aG V°(m), then 



/bevr''(m) / 



< 



i/rV |m|-|i| + r'^ 



(qm|)r 



othe 



(7.11) 



/a 



9A ^ /b ) = 1. 



(7.12) 



'^- If f& G V'^(m) so t/iat its set of elementary column-modifiers is £'" — {mi^uj\ — mij}, then 

IICIiAtt^ c, 



n 



^ ^f<^ V\/^Vm. -Mj 



r(n), 



(./■a|gAC/(C,^,«)^"/o)=cxpU(/. 
wii/i i/ie phase and error factor 

d-l 

r(n) = 1 + (n-i+2^+"r\n-i/2+2/35-i,n-i+2^+"ri) . 

iS- -(f /a G V^(ni), so that its set of column-modifiers is E — {m^ ■ '«} (md T[E) — T, then 
{f.\qxUiC,^,nr^^fo) 



(7.13) 






n(5 / f^-^ V V^VM* - Mj 



r(n) (7.14) 



mt/i C = C(d) a constant and r{n) as in point^ above. 

Under the further hypotheses that \\z\\ < n^ , m.ij < 2\Q,j + Zij\n"'^'^ for some e > 0, we have: 



\/a,eOA(m) 



qxU{C + z,^,n)fo 



exp iq 



\\C + 4l 



z— i ^W 



i<j 



TOi 



(7.15) 



with 



r{n) = 1 + (n-i+2^+"r\n-i+2''+"r\n-i+2"5-\n-i+"+"<5-\r3/2n-i/2+3^+2^) 
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10. Under the further hypotheses that |1| < |ni| and n^ '^'^ > 2C jS^ , where C — C{d), 



\/a60A(I) 



9A Yl •^b 

/beOA(m) / 



<(C|m|)l'"l-l'l[|- 



i<j 



"■J J ■ 



n(5^ 



with 



rS.i„am)> 



(|l-m| + 3|l|-3|m|)_ 



(7.16) 



(7.17) 



ii. FKe have 



^/aGOA(m) 



'^A E /w=n ^'' 'f" (i+o(-^^-v^))- (7-18) 



/beOA(m) 



1<J 



H,]- 



Proof. 

Proof of (17. 6p . We first express {faW^'^fb) as a product of matrix entries of U: 

(/aic/^'7b>= n n (/*£wi^/*&w) 

l<c<Ai l<r</(c) 

= n n ^tl{r).tl(r)- 

l<c<Ai l<r<l(c) 

Since the subgroup of column permutations C\ is the product of the permutation groups of each 
column, each a ^ C\is <j — si . . . sx^ with Sc a permutation of cohuim c which transforms t^{r) into 
tgiscir)). Then 

(/a|<ZAC/^"/b) = (/a|C/^"gA/b) = ^ <'^) IT IT ^^^SM.i&C^e W) 

ctSCa 1<c<Ai l<r</(c) 

= n E ^(^=) n ^tjW,t&(«c(r)) 
l<c<Ai ScSSc l<''<i(c) 

= Yi det(C/*-*^). 

l<c<Ai 



Proof of (|7.8p . The number of admissible /a such that r(/a) = is given by the products of the 
possibilities at each stage of the algorithm. For the first two stages, there is exactly one possibility 
when r = 0. Hence #V" is the number of possibilities at the third stage. Here the upper bound 
(|7.4|) reads as n7>j('^i ~ -^jO^'V^^ij'- O'^ the other hand, we may use (|7.5|) as a lower bound, 
recalHng that A^ - Xj > Sn/2 and |m| < n'' (cf. (frf]) ). This yields the result ([7] 



Proof of (|7.9p . The number of /a in V^ is given by the third stage of the algorithm (the two first 
stages yield a particular E). We then obtain (|7.9p by applying (|7.4p and neglecting the ttt.^! factors, 
while noticing that ^^ r7i„ = |m| — F. 
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Proof of (TTTU)) . The set V^ is the union of all V^ with r(£;) = T. Now the first two stages of the 
algorithm imply that there are at most C^ different E with the latter property, with C — C{d). 

Now we use (|7.9p to upper-bound V^ as follows. Since X]'^K(ij) — l"^l ~ ^F, we may write 
rift "^K(ij)' ^ Y[i<j "i^i.j^- suPi<j "^~ . Moreover A^ — A^ > 5n/2. By putting together we obtain 






Multiplying by the number of possible E yields the result. 

Proof of (|7.1ip . We applying (|7.6p with [/ = 1. Since both /a and /b are product of basis vectors, 
the scalar product (/a | q\fh) is equal to —1 or 1 if t^{[l, l{c)]) — t^{[l, i{c)]) for all columns, and 
otherwise. Here we denote by t^{[l,l{c)]) the set of entries {ia(l)i • • ■ 7*a('(c))}- 

Now, since a modified column cannot satisfy ia([l, i{c)]) = [1, 1(c)] (and the same for b), the vectors 
/a and /b are orthogonal unless they have the same number of modified columns. Finally, that 
number is |1| - F"^ for /a and |m| - F*- for /b. This yields the first fine of (fTTTj) . 

We now concentrate on the case when F'' = |m| — |1| + F". Since |(/a | <ZA/b)| < 1, we can 
bound the sum of scalar products by the number of non-zero inner products. The question is 
how many diagrams /b have the same content (seen as an unordered set) in each column as fa- 
tlillJic)]) = tii[l,l{c)]), or equivalently 5«) = Si^^). 

For building the relevant /b, we can follow the algorithm with the further condition that, at stage 
three, all the column-modifiers are applied in such a way that the unordered column content is 
identical to that of /a. 

The first two stages of the algorithm are the same so they yield a C^ factor. We now have a 
collection {m^} of column modifiers which have to be placed such that they match the column 
content of /a- For each S we identify the column modifiers ki, . . . , K^rs) such that S{Ki) = S for all 
1 < i < r{S). The total number of such objects is ms :— J2i<r(S) "^^i ^^"^ ^^'^ number of ways in 
which they can be inserted to produce distinct diagrams is 

I ms \ 

Recall that the number of elementary column-modifiers J2i<i "^K;(ij) is at least |m| — 2F''. Moreover, 
each elementary column-modifier K{i,j) corresponds to a different S{K{i,j)) — {(i, — ), (j, -I-)}. Thus 



|m| - 2F'' < ^ m,(,j) < ^ ^ max ^to« 
Since 



,. ■.S(k) = S 
i<] b 



'Y^ms^^vi,^^ |m| - F^ 
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we obtain 



This implies 



^r^-.^.T^^S-'^' 



n "^^ h i-i'^- 

Multiplying by the C^ of the first stages, we get (I7.1ip . 



Proof of (|7.12p . As shown above the only non-zero contributions come from /b G V" C 0\{iLn). 

Since T^ = 0, the constant from the two first stages of the algorithm is 1, 777,5 — ''^i,j — "^K(ij) for 
all S corresponding to an elementary column-modifier, and otherwise. So the combinatorial factor 
is again one: we do not have any choice in our placement of column-modifiers. In other words, the 
only /b such that (/a | qxfh) 7^ is /a. Finally, (/a | qxfa) = 1- 

Proof of (frT3)l . From dLH) we deduce 

(/a|gAt/(C,e,nf"/o>= n det(t/*-"^), U^Uilln). 

l<c<Ai 

We will use the Taylor expansion of the unitary t/(C, S,, n) to estimate the above determinants. 
Entry-wise, for all 1 < i < d on the first line, and all 1 < i < j < d on the second and third lines: 

+Om\'n-^'H-^'\\\M\\n-'5-''\\\ifn-')-, 

U^All^) = -^ /'•' +OiKrn''S-\,\\M\\n-'S-'/'); 
UjAC,Cn) - -^ /"^' +0{\\Cfn-'6-\\\m\\n-'S-'^'). 

If C = O(n^), lllll < n-i/2+2/3/j^ ^^^ p < 1/2, the remainder terms are 0(n-3/2+3/3j-3/2) ^j. ^j^g 
first line and 0{n^^^'^^6~^) for the last two lines. 

Therefore, when our parameters are in this range, we can give precise enough evaluations of the 
determinants. The idea is to find the dominating terms in the expansion of the determinant 



detA = J2Y[4'^)A,a 



CO- 



Note that we can use the above Taylor expansions inside the determinant since the number of terms 
in the product is at most d. 
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Since /a G V^, all t^ are either Id'^, or an {i, j)- substitution. If f^ ~ Id'^, the summands with more 
than two non-diagonal terms are of the same order as the remainder term, so that only the identity 
and the transpositions count in ^^ Y\^ A^ ^.(i). Let I = l{c), then 

^ /rj VT) -^ — ^ //, — //, 

l+l<j<d 



Note that for I = d, we get the usual determinant of U{(^, ^, n) which is 1. 

Consider now the case f^ ^ Id^. Since t%(r) > r for all r, there exists a whole column of [/*a.M 
whose entries are smaller in modulus than 0(||C||/vri^) = 0(n-^/2+/3^-i). j^ particular if i^ is an 
(i, j) -substitution, then the only summand that is of this order comes from the identity. So that 

v{z,j) :=det([/*^M^(C,^:n)) ^ ^ fr" + 0{n-'+'^6-^). (7.19) 

Note that this approximation does not depend on /(c), but only on i and j. 

We now put together the estimated determinants in the product (|7.6p . For each i < j there are niij 
columns of the type (i, j)-substitution. Out of the A; — A;+i columns of length / — l{c) there are 
A; - A(+i - Ri of the type Id^ with < i?; < |m|. 



Hence: 



d d 



{u\qxUic,inrvo) = i[ivii))t'-^'^' n i^i^^j)r"Ei^mr'''- (7-20) 

1 = 1 l<i<j<d 1 = 1 

Now v{l) = 1 + 0{n-^+^l^S-^) and Ri < |m| < n'', so the last product is 1 + 0{n-^+^^+'JS-^). 
Similarly, since A € A„^q, we have A; — A;+i = n{fii — fii+i) + 0(n"), and we can use Lemma 17.21 
given at the end of this section to estimate the first product as follows 

^ r , _ .._ ^ 

r{n) 



1=1 I l<i<l 



1=1 



2 1^, "''' /^^-^^ 



CI 



with 



exp i(j) -— r(n) 



0i = Si^dVn{fJ,i - m+i)£,i, 



Vn'^im - iJ'i+i)£,i 



1=1 



We now turn our attention to the middle product on the right side of (|7.20p 
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where we have used that |m| < n^. 

Inserting into (fT^ yields ((71^ . 

Note that {fa\q\U{(, £,, n)^"-fo) = if there exist i < j such that Qj — and rriij ^ . 

Proof of ([Till) . We may write, much hke in fL2U\i . 

d 

(/a|<ZAf/(C, f, nr-fo) = n {V{l))f'~^''' n (^('*))'"~ 11(^(0)" 



N-^1 — -^I 
i = l K 1 = 1 



where < i?; < |m| — T and v{k) is the determinant of the minor of U corresponding to having 
apphed the column-modifier k. We can further split the column-modifers into elementary ones K(i, j) 
and non-elementary ones k' . 

Then {feL\q\U{C,£,,n)'^"fo) can be written as 

1=1 i<j 1=1 k' 

The first three products on the right side can be treated as above. For the fourth product we give 
a rough upper bound based on the following observation. If the entries in the column have been 
modified in an admissible way, then t'^{i) — j > l{c) for some i, so that |w(k)| < C||(^||/vn^ for any 
K, with some constant C = C{d). 

Thus by using the previous point 



(/akA;7(C,e,«)^"/0> 



< 



-f-#U^)""'"nf:,a^)' ^N^ <'-) 



2 / V %/^ / f^- \ V^y^jM^^ 



We obtain (|7.14p by noting that the number of non-elementary modifiers is 



i<j 



Proof of ()7.15p . Note that only admissible vectors in C'A(ni) can bring non-zero contributions. We 
shall split the sum into sub-sums using Ox{m.) Pi "^^ = U_e "^"^(111), and compare each sub-sum against 
the benchmark V*^ = V^° . 

From the bounds on ( and z we obtain ||C + z|| = 0{n'^), so we can apply the previous points with 
( + z instead of C. 



Using (|7.8|) and (|7.13|) and recalling that A e An,a, we get: 



\Uevo 



qxU{C + z,(,n)'^ /o ) =exp 10 [[ ■ r{n) 

I \ ^ ) ^<J '^''^■ 

46 



with error factor 



For E ^ E'^ we combine (fTTil and (frojl to obtain 






qxU{C + z,i,n)fQ 






qxU{C,+z,£„n)fQ 



< n 



n 



ni^.(j tI— mv 



A,-AA"'»"-'- 



mi 



.1 



IK + ^11 



-r 






n 



^ICij +^ijl 



m.wv „-\— rnv 






IIC + ^llv^VMi-Mi, 



r(n) 



< 0(n-r(i/2+/3))rr/2 -Q 
with O(-) uniform in F. In the second inequahty we used 



E(™«(^J) ~ ™^^j) - ~^^' ^ € A„ 



i<j 



and in the third inequahty we used 



^^i,j — ^\^i,j ' ^i,j 






m.ij||C + ^1 



< 1. 



Furthermore, for a given F, there are at most C^ different E such that T{E) = F, corresponding to 
the possible choices in the first two stages of the algorithm, where C = C{d). Hence, if n is large 
enough such that 2C(5^^/^n^^/^+^^+^^ < 1, we have: 



E /a 

\/aeOA(m) 



qxU{C + z, e, n)foj^Y.\ E /« 9aC/(C + z, I n)/o ) 



1 + 0((5-3/2„-l/2+3^+2e) g^p ^^ _ 



n 

exp U(/) ^ I YY^ -— j '- r2{n) 



mij! 



r[n) 



rriijl 



where the sum over F was bounded using a geometric series and 
This is exactly (fTTSJ) . 
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Proof of (|7.16|) . We choose F" and T^ satisfying the condition F'' — F" = |m| — |1| under which the 
inner products in (|7.1ip are non-zero. By multiplying (|7.10p and (jT.lip . we see that: 




/bevr''(m) / 



„cHrn^^#(«)^" 






(q„|)l™Hi|J|(^ 



It remains to sum up the upper bounds over all relevant pairs (F'',F''). If n^^'^'' > 2C/S'^, the 
dominating term in the sum of bounds is that corresponding to the smallest possible F°. The 
question is, what is the smallest possible value of F" leading to non-zero inner products? 

A necessary condition for /a not to be orthogonal to /b is that for each set 5 of suppressed and 
added values, the two vectors have the same multiplicities m^ ~ nig. 

The following argument provides a lower bound for F(/a) -l-F(/b). The idea is to count the minimum 
number of 'horizontal box shuffling' operations necessary in order to transform a Young tableau 
ta' G 0\{in) into the tableau ta- Since |m| < n^ and Ad > Sn + 0{n°'), the tableau ta' can be chosen 
to have at most one modified box per column (thus F(/a') = 0), and such that each of the modified 
columns of ia are also modified in t^'- We also choose tb' in a similar fashion. 

Now at each step we horizontally move one elementary column modifier n{i,i) of ia' (or ^b') into an 
already modified column, with the aim of constructing ia (or ib)- 

Each such operation increases F(/a') + F(/b') by one. On the other hand the operation has the 
following effect on the rrVg (or rrig ): the multiplicities TO{(j _) (^ ^.-j} and mg,-, decrease by one, and 
TO5(j_|_{(j _)_(j_+)j. increases by one. Here Sq is the signature of the column to which the box («, j) is 
moved. Hence the distance ^^ \m'^ — m^ \ decreases by at most three. Since initially this quantity 
was equal to X^io \^i-j ~ "^jjL ^^ need at least X]i<7 \^i,j ~ '^jjl/^ such operations before reaching 
our goal ml — rrig. This means that F(/a) + F(/b) > |1 — ni|/3. 

Together with r'' - F'* = |m| - |1|, this resuh yields F° > (|1 - m| + 3|1| - 3|m|)/6. Moreover F"^ is 
non-negative. 

Replacing in the above equation yields (|7.16p . 

Proof of (|7.18p . Since 1 = m, equations (|7.8p and (|7.12p prove that the bound (|7.22p is saturated 
when F° = 0, up to the error factor (l + 0(n^^+^''/(5)). Hence the remainder term due to the other 
F consist in a geometric series with factor ( }^} j = 0(n^^"^''/(5^). 

D 

The only part of the proof we have still postponed is the following technical lemma: 
Lemma 7.2. If Xn = 0{n^l'^^^), then 

(l + ^)" = exp(x„)(l + 0(n-^)). 
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Proof. For simplicity we will ignore the dependence on n and write x ~ Xn- 
For any y such that \y\ < 1, for any n G N , we have the Taylor expansion: 

fc=l ^ ^ 

Now (n - k)''/k\ < (2) < n'^/kl for n > k. If fc < n^/^-"/^, then (n - fc)''' = n^{l + Oin-")). If 
k > ni/2-'/2, then nV^! = 0(n(i/2+^/2)fc)^ go that if y = x/n = 0{n-^/^-'), 

(l + x/n)" = (l + 0(n-^)) J2 1J+ H 0(^(1/2+^/2)'= (x/n)'= 

/c=0 ■ fe>ni/2-'!/2 

= (l + 0(n-^))exp(a;)+ ^ (0(n(i/2+^/2)'= - nVfc!)(a;/n)'= 

J.>„l/2^./2 

= (1 + 0(n-^)) exp(a;) + 0{e~"'^""^") 
= (l + 0(n-'))exp(a;), 

as exp(x) > Cexp(— 71^/2^^)) for some constant C > 0. D 

7.2 Proof of Lemma 15.41 and non-orthogonality issues 



Lemma 7.3. Let (m, A) anrf (1, A) be semistandard Young tableaux with diagram A and define 

|m 

If 



|m| := Ej<j ™»j fl"'^ |1 - m| := ^^<^. |;,j - m,j 



for some 1 < i < d, then 



j>i j<i j>i j<i 



(m,A|l,A) =0. 

Otherwise, we derive an upper bound under the following conditions. We assume that Xi — Ai+i > Sn 
for all 1 < i < d — 1 and Xd > Sn, for some S > 0. Furthermore we assume |1| < |m| < n^ for some 
rj < 1/3 and that Cn^^^^ /S'^ < 1 where C = C{d) is a constant. 

Then: 

|(m,A|l,A)| < (c'n)-''(l'"l-l'l)/4(C"n)(9''-2)l— 11/12 ^(|m|-|i|)/2-|m-i|/3(^^ 0(^-1+3^/^))^ (7 23) 

where C — C"(d, ry) and the constant in the remainder term depends only on d. The right side is of 
order less than n'-®''~2)|m-i|/i2 ^^^ converges to zero for rj < 2/9 when n -^ oo. 

Proof. We know that |m, A) is a linear combination of ?i-tensor product vectors in which the basis 
vector fi appears exactly A^ — ^^^^ '^i.i+X^Ki "^j,i times. As two tensor basis vectors are orthogonal 
if they do not have the same number of fi in the decomposition, we get that (m, A|l, A) = if 
Ej>» rn^j + 2^.< . mj- , ^ Y.j>i kj + 'Ej<i ^,^ for any 1 < i < d. 
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In the general case, 



(m,A|l,A) = 



{qxp\fm\q\pxfi) 



\/{q\P\fm\q\P\fin)(q\P\fi\q\P\fi] 



(7.24) 



We use the fact that q\ is a projection, up to a constant factor (cf. ()5.ip . (|5.3|) ). and erase the q\ at 
the left of each scalar product, and we decompose pxfm and pxf\ on orbits as in (|7.3p . Since the 
multiplicity of the elements in the orbits are the same in numerator and denominator, we end up 
with: 



(m,A|l,A) = 



(L/,eOx(m) /akA Yjfi,(iOx{\) ^^^ 



(E/,eO^(m) /a|9A E/^,eO;,(m) f ^') {T. f^,eO x{\) Ml^ E^,eO;,(l) /b' 

We use (|7.18p for the denominator: 



(7.25) 



E /a 

\/«eOA(m) 



q\ X! ^'^ 

/,,eOA(m) 



^/beOA(i) 



q\ X! ■^i'' 

/b'eOA(i) 






and the numerator is bounded as in (|7.16p . Then, under the assumption |m| > |1| we have 



(m,A|l,A)|<(qm|)l™l"l'l 



C\m\- 



S'^7 



i<3 






{l+{0{n'^-'/6))) 



where r™„ = ((|1 - m| + 3|1| - 3|m|)/6) A 0. 
The factorials can be bounded as 



n 



H,]- 



< m 



T.i"l^.J~h.J)+/2 _ |^|(|m-l| + |m|-|l|)/4 



m 



Since |m| < n'' and Cn"^^ ^ /S^ < 1, we have 



C|m|3\^""" /(^^^37,-l\ (|l-H+3|l|-3|m|)/6 



6^7 
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Since Xi — Xj > nS we have 



Yl (A, - A,)(''--™--)/2 < (n5)(|i|-l-l)/2. 

l<i<j<d 

The constant C = C{d) can be replaced by another constant C — C'{d,rf) such that all powers of 
n appear in the form [Cn)'^ . Putting the bounds together we get 

|(m,A|l,A)|<(5(l'"l-|'l)/2-l— i|/3(c'n)-''(l-l-l'l)/4(C'n)(9''-2)l™-i|/i2(i + o(^-i+3'Y5)) 



D 



A consequence of this lemma is the following. 
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Corollary 7.4. Let rj < 2/9 and let (m, A) be such that |m| < n^K Assume as in Lemma \7.3\ that 
Xi — Xi+i > Sn for all 1 < i < d — 1 and Xd > Sn, for some S > 0, and that Cr?^^^^ jS^ < 1 where 
C — C{d)isaconstant. 

Then there exists a constant C" = C"{d,vi) such that 

J2 |(m,A|l,A)|<(C"n)(^''-2)/i2^-i/3_ (726) 

|l|<n'' 

Proof. We break the sum into two parts (|1| < |m| and |1| > |m|), and by triangle inequality it 
suffices to prove the statement under the additional condition |1| < |m|. 

We use (17.231) neglecting the terms containing |m| — |1| in the exponent which are less than 1. Then 
the expression on the left side of (|7.26p is bounded from above by 

2^7V(fc)[(C'n)(9''-2)/i2j-i/3l' 
fe>i 
where N{k) is the number of I's for which |m — 1| = k. 

Since there are d{d — l)/2 pairs 1 < J < j < d, there are at most (fc + l)''('^~i)/2 different choices 
for the values {|?i.j — "T-i.j| : « < j} satisfying ^ j/^j — ^Tiijl = fc. Moreover, there are 2''(''~i)/2 gjgj^ 
choices which fix 1 = {?i.j} completely. Thus Nik) < (2(fc + l))''('^^i)/2 < c'^ for some constant c 
which can be incorporated in the geometric series starting at fc — 1, hence the desired estimate. 

D 

We use this quasi-orthogonality to build an isometry Vx : Ti.\ -^ T which maps the relevant finite- 
dimensional vectors |m, A) 'close' to their Fock counterparts |ni). This is the aim of Lemma WM 

Lemma 7.5. Let A be a contraction (i.e. A* A < 1) from a finite space Ti. to an infinite space tC. 
Then there is an R : Ti —i- IC such that A + R is an isometry and Range(v4) _L Range(_R). 

As a consequence, for any unit vector (j), we have ||i?(/)|p = 1 — ||j40||2. 

Proof. As /C is infinite-dimensional, we may consider a subspace TC of /C, orthogonal to Range(A), 
and the same dimension as 7i, so that we can find an isomorphism / from H to H' . We then take 
R = /Vl - A* A. 

D 
Proof of Lemma\5^ Let A\ : Ti\ -^ !F he defined by 



Then, 



Ax :^ , ^ = y |1)(1,A| 
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where the last inequahty follows from Corollary 17.41 and the following argument. It is enough to 
show that all eigenvalues of ^^^a are smaller than 1. Let J2m ^^ l'^' ^) ^^ ^"^ eigenvector of A\Ax, 
and a the corresponding eigenvalue. Then by the linear independence of |m, A) we get that for each 1 

l + (C„)(9.-2)/i2Mi/3 E aA|m,A)c^ = ac,. 

' "^ ' I |m|<n'J 

If lo is an index for which |ci| is maximum, then by taking absolute values on both sides we obtain 

^ ' ' |m|<n'7 

Now we may apply Lemma 17.51 and find an Ii\ such that A\ + R\ is an isometry, and Range (i?A) -L 
Range(A), so that (m| Rx = 0. We define Vx := Ax + Rx- Then 

{va\Vx = lM{Ax + Rx) 
= {m\Ax 

^ (m| y |1)(1,A| 



(m,A|. 



|l|<n'7 
1 



v/l + (Cn)(9"-2)/12/51/3 

D 

7.3 Proof of Lemma 16.41 on mapping rotations into displacements 

We first recall a few definitions and notations. We denote by D^ the displacement operation (super- 
operator) acting on observables in the multimode Fock space JF as 

D^iWiy)) := Ad[W{z)] {W{y)) = e^'^^^^^ W{z + y), y,ze C'^(rf-i)/2. 

The operation acts as displacement on coherent states, in particular 

Di^+^i\0){0\) = \C + z){C + z\. 

Similarly, on the finite dimensional space (C^) we have the action (cf. (|7.ip ) 

whose restriction to the block A is Aj^''"'" = Ad[Ux{CTS,,n)]. 

The isometric embedding Tx{-) := Vx ■ Vx and its 'adjoint' T^{-) :— Vx ■ Vx satisfy 

TaaP'«''"t;(|o) (o|) = Vx\c + z,Cx){C + z,Cmv;: 

where \( + z, £,, A) := Ux{C + z, ^, n) |0, A) are the 'finite dimensional coherent states'. 
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According to Lemma [5^ the coordinates oi V\\(^ + z, ^, A) in the Fock basis are described by: 



{in\V^\C + z,tX) 



something not important if |m| > n''. 



(7.27) 



Using the relation |||V')(V-'I — IV'')(V''ll|i = 2-^/1 — \{iIj\iIj')\'^ , which holds for unital vectors ipjip', the 
statement of the lemma is equivalent to 

sup sup sup sup 1- {z + C\Vx\C + z,^,X) ^Rinf, (7.28) 

with R{n) the original remainder term. 



We shall prove formula (|7.28[) by decomposing these vectors in the Fock basis, that is 

(C + AVx\C + z, i A) = 5](C + z1m)(m|14|C + z, I A). 



(7.29) 



The estimates are based on the following observations. 

1) The coherent states have significant coefficients (C + z\m) only for 'small' m's, i.e. those in the 
set 

M:={m: m,j < |(C + ^)»jf n% i < j}. (7.30) 

In particular, since 2/3 + e < 77 we have Al C {m : |m| < rf'}. 

2) The coefficients (m| Vx|C + i*, Cj ^ are uniformly close to exp(i(/))(C + z\m) where is a fixed real 
phase, in particular uniformly over m G A^. 

3) If am and h„i are the two sets of coefficients, such that Ylm kmP = Ylm l^mP = 1, then 



y^^ flm^n 



< 1- 



y^ am^n 



meM 






< 2 1- 



y^ Omferi 



mSTM 



The precise statement in point 1) is 



^ |(C + z|m)p<rf 



2 ^ w2„-/3 



(7.31) 



(7.32) 



Indeed, the inner products can be written as a product over the («, j) oscillators and we have the 
bound 



m^M i<j 



Each of the terms in the sum is a tail of Poisson distribution and is bounded by n"'^"'^ if Xij > 1 
and by n~^ if Xij < 1. 



," T 



We turn now to point 2). From the third line of (|7.27p we get 



\/(y\fo\yxfo)\/(y\fm\yxfn 

{pxfni\q\U{C+ z,£,,n)fo) 



ViP^fmkxPxfn 



(l+0(n(^''-2)/125-l/3)) 
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where we have used (I5.3P and (|5.6p . 

We recah that 0\{iLn) is the orbit in (C*)**" of /^ under 7?.a and that we have the decomposition 

f — \^ ^^^ f 

Then, by employing formulas (|7.15[) and (I7.18p . we can write 



#-||C+g|l^/2 -Q (C+ ^)r;" / "(a^» - A^jO V''"^' ^(^^^ 



,<j V"^»,i! V ^i - ^j 



The corresponding remainder term is 

r(n) = 1+0 (n(9"-2)/i2^-i/3^ ^-i+2/3+„^-i^ ^-1/2+3^+2.^-3/2^ ^_l+„+2^^.1^ ^_l+„+,,^_l^ ^_l+3,,^-l^ 

and the phase is: 

d-l 

(/> = V^X!^'^* -M»+i)C»- 

i=l 

Since A S A„^q and the eigenvalues are separated by S we have f "y*,/* ) = 1 + 0(7i"~^+''/(5) 

and the error can be absorbed in r{n). 

In conclusion, for m satisfying (|7.30p . we have: 

{ni\VxUiC + z,^,n)\0,X)=exp{i(b){m\C + ^r{n). 

Inserting this result into (|7.29p . and using (|7.3ip and (|7.32p . we get 

l-\{z + C\VxU{C + z,ln)\0,X)\ = O l-r(n), ^ |(m|C + z)|2 =i?2(n), 
with 

R2in) - O (n(9''-2)/12^-l/3^ ^_i+20+,^_i^ ^_l/2+3/3+2.^-3/2^ ^-l+a+2/3^-1^ 

Through expression (|7.28p . noticing that R2{n) = R{n)^, we see that we have proved the lemma. 

D 

7.4 Proof of Lemma 16.21 on typical Young diagrams 

Recall that the state p^'" :— pf?^ has the decomposition over 'blocks' A given by (|4.8p . The 
probability distribution over Young diagrams p^'"'" depends only on the diagonal parameters u and 
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is given by 



U.71 ^ 



pt" = c^ E n(A^r)'' n ( ^„ 



me A 4=1 j=i+l \^^i 

with 

x^f n \^ Ximt=i+ii^i-^k + k-l) 

""" \XuX2,...,xJl\ {Xi+d-iy. 

The above formula can be understood as follows. By invariance under rotations we can take C = and 
the state is diagonal in the standard basis basis (C^) formed by the vector /a. Each eigenprojector 
carries a weight 11^=1 (/^"'")™' where rrii is the multiplicity of the vector fi in the tensor product 
/a- Thus, we only need to add all multiplicities over vectors that are 'inside' the block A. Since the 
irreducible representation has basis /m labelled by semistandard Young tableaux, we get a factor 

d d d / S,"\ "''J' 



■i— 1 z— 1 j^i-|-l 



u.n 



The additional factor c„ is the dimension of JC\ , on which the state is proportional to the identity. 

Recall that /x"'" = ^^ + Ui/y/n for 1 < i < (d — 1) and /i^'" — fid — {J2i Ui)/ ^/n. li 5 > 2dn°'~^ > 
2d'n?~^/'^ then /i"'"//i"'" < 1 for all ||-u|| < rC . Moreover rm^j < n for all (i, j), so the total number 

,2 

of m's is smaller than n . Thus 

(u,n\ '"•,J 

On the other hand m = is always in the set of possible m, so that 



One can easily verify that 



^-^-rT Az!nL+i(^'~^fe + fe-0 ^ 



;^^ {Xi+d-iy. - (n + d)^'' 

The remaining factor is the multinomial law. We now show that this is the dominating part. Let us 
write (Yi, . . . , Yd) for the multinomial random variable. Then we have 

P[|y, - n/xf "I >x]< 2exp (-^) ■ (7.34) 

Indeed each Y^ is a sum of independent Bernoulli variables Xi, . . . , Xn with P{Xk = 1) = /i"'" and 
F{Xk = 0) = 1 - /if'", and by Hoeffding's inequahty [44] 



\J2^k- nXk]\ > x] < 2exp I -— J . 
fe=i ^ '^ ^ 
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By definition, for any A ^ A„^q, there exists an i such that \Xi—nfii\ > n", which imphes |Ai — n/i"'"| > 
n" — drf'^^/''^. With n°'~^~'^/'^ > 2d, the upper bound is simply n"/2 and we have 



E 11^^ 



.;, Ill - P[A ^ A„,„] < n^-Y,nY^ - nt,f"\ > n"/2] 



<2dn'''exp(-7i2"-72). 



D 



7.5 Proof of Lemma 16.11 and Lemma 16.81 on classical LAN 

We shall use multinomials as an intermediate step. Recalling that b^"' — Px"t\j we can write: 



M{u,V,)~Y.f'x" 



< 



/■" - M"„^,„ 



+ 






(7.35) 



where M", ,, ,, „ is the d-multinomial with coefficients u"'". 

Ml' ^---.Md' 

Concisely, what we really prove in this lemma is the equivalence of the following classical experiments, 
together with an explicit rate: 

>(« = |m\„ „„,\\u\\<nA, 
y. Ml y-'f^d J 

gn^Wiu,V^),\\u\\<n-'}. 

Recall that p^'" does not depend on ( and is denoted p"'". We shall use the shorthand notation 

M"'":=A/"a„ a„- 

Ml' :---,Md' 



We ffi'st bound the ffi'st term on the left side of (|7.35p as follows: 

„-l/2+7 I „a-l 

sup \\p"^" - M"^"||^ < C . 

||S||<ri-' 

To show this, we rewrite: 

||p«," _ M"'"||^ = J2 K'" - M"'"(A)| 

|A|=« 



(7.36) 



< Yl \pr-M"'^{x)\+ Y. [pT+m-^^x] 



AeA„.a A^A„ 

Lemma [O and (TTM)) imply that for all ||ir|| < n^, and n > {2d/5)^ + (2(i)°-^-i/^ 

^ pr + ^^"'"(A) < Ci exp(-(C2n2"-i)), 

A^A„ 
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with Ci and C2 depending only on the dimension. We end the proof of (|7.36D by recalling that 



Px = 



n 

1=1 



(Xi + d-iy. 



u,n\ "'»>J 



ERR:. 

m6Ai<j \"j 



M"'"(A). 



Now, for all ||u|| < n'' and all A G A„ „, the right hand side without the multinomial is 



Kk 



/^; 



n 1 - ^ + oK-v^) E n t" + oin-^/^+v-^) 



mgA J<j 



Mj 



M» 



If Xi-Xi+i > (d-l)ni/2 tj^gj^ ;\^ contains aU the multiplicities m in the 'cube' {0, 1, ... , ni/2}d(d-i)/2_ 
Since /ii — /i^+i > (5, the condition holds for all A £ A„_q,, with n satisfying nS > 2dn°' . Thus 



,1/2 






Putting together yields 



n ^i-I\k=i+i^i -Xk + k-l I ^1 

(\,4-H-l\\ Z^ 11 






1=1 



{\i + d-iy. 



meXi<j \^^i 



<c- 



-1/2+7 I „"-l 



We have thus proved (|7.36p . 



The second term in (|7.35p can be treated by 'classical' (albeit technical) methods and we refer to 
[29] for the details of the proof. The result is 

sup \\Af{u, Vf,) - Af "^"11 1 < C{n-'/^+' + n-^^^+'')/S, 

WuKni 

for n-1/2+7 > cS/2 with C = C{d). Together with dLSH), and noticing that a - 1 > e - 1/2 for 
small enough e, we get the desired rate of convergence for Lemma |6. II 



From here, proving Lemma 16.81 (that is the inverse direction) is easy enough. Indeed, recall that 
^n^np9,n „ ^e.n ^^j^j ^Yia.t cr" is a contraction. Then 



a'W{u,V^)-p 



C:U.7l 



a"AA(u,F^)~a"TV 



n n C^u^n 



< 



Af{u,V^)-T''p' 



XL C,;U^n 



So that we have the same speed and conditions as those of Lemma 16.11 



D 



7.6 Proof of Lemma 16.31 on convergence to the thermal equihbrium state 

We recall that the state $ on CCR{L^{p), a) was defined in (|4.30p and is the product of a classical 
Gaussian distribution and d{d— l)/2 Gaussian states <I>,;.j of quantum harmonic oscillators, one for 
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each pair i < j. $j j are thermal equihbrium states with inverse temperature (3 = hr(/ii//ij) (cf. 
(|4.15p V The joint state $° := ^^^^^ij is then displaced to obtain $^ but Lemma 1531 is only 
concerned with <i>°. 

It is well known that thermal equilibrium states are diagonal in the number basis and in our case 

$°= E n^^^^(-) l™)H. (7.37) 

As shown in (|5.9p . a similar formula holds for the finite dimensional block states p^^'": 

(m,AK'"'"|m,A)=Cfn %r ' (7-38) 

i<j \^^i / 

where C^ is a normalisation constant, /i"'" = I-M + uij -Jn for 1 < i < (d — 1) and /i^'" = fid — 
{J2iUt)/Vn- 

However there is a caveat: although |ni, A) are eigenvectors of p^"'^, they are not orthogonal to 
each other so we cannot directly use |m, A) (m, A| as eigenprojectors in the spectral decomposition. 
However, Lemma 15.41 gives us an estimate of the error that we incur by doing just that. 

Not first that the eigenvalues of p^'"'" are labelled by the total multiplicities rrii of the index i in the 
semistandard Young tableaux : 

rrii := Ai - E "^iJ + E "*J'*' 

Let us denote by 7i({mi}) and P{{mi}) the corresponding eigenspace and respectively eigenprojec- 
tion. Then 

{rrii} i=l 

As in Lemma l5.4l we have 

Pi{m})^ ^^^^^^^X)n2s-i/s E |m,A>(m,A|+ii;({m.}) 

in:{mi} 

where the sum runs over those m with total multiplicities {rrii}. The (positive) reminder has trace 
norm 

Tr(£;({mJ)) = 0(n(9''-2)/i2^-i/3) . dim(H({m,})). 

By summing over all {rrii} we get 

1 -- '^ - 

PX = 1 + C^(9,-2)/12^-1/3 Pa' ' +Ca E i\.^^'^ ) '^({™a), 

{mi} i=l 

where p_^'"'" is the approximate state 

P°?'" := C:« E n(A^f '")"'"'' E |m-A)(m,A|. 

{mi} i— 1 m:{mi} 
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The error term has trace norm of the order 

d 
0(„(9r,-2)/12^-l/3) . ^u ^ [] (^f '")™.->.dim(W({TO J)) = 0(n(9''-2)/12^-l/3)_ 

{m,}i=l 

where we have used the normahsation of the block state p^^"'^ ■ 



In conclusion 



\Px 



0,u,n ~0,u,7i 



p°'"'"||i = 0(n(9''-')/i2ri/3). (7.39) 



The next step is to show that the block states p_^'"'" are mapped by Tx close to $". Using (|5.9p . we 
can write 

r,(pp'") = Cf^n % T,(|m,A)(m,A|). (7.40) 

mGAi<i \^^i / 

If n"^^ < (5/2 and a > 1/2 > 77, we know that all m such that |m| < n^ 'fit into' A. 
Since /if" = /i, + 0(n-i/2+7)^ ^^cn |m| < n'', 

(7^) =(S) (1 + 0(""'^'^''^V'5)). (7.41) 

For the normalisation constant we can write: 



(cfr^- E n Fir + E n^' 



|ni|<ri'Ji<j \^^i J me A:|ni|>ri'J i<j 



__, _ u^n 

A^i / mFA:linl>Ti,l j<1 \l^i 



If 2dn''' ^/^ < (5/2 then the second part is less than n'^ (1 — 6/2)"'^ which is negligible compared to 
the other error terms. Hence: 



(^a)-^- E n - (1 + 0(^-1/2+7+./^)) 

|m|<ri'J i<'J ^^'-^ 

E n(-) (1+0(^-^/^+^+7^) 



^gpjd(d-l)/2 i<j 



rr_^^^(i + o(„-i/2+7+r,/5))_ (7,42) 



We then recall that for unit vectors, we have |1|Vj)(V'| — 10) ('/'I 111 = 2-\/l — |(i/)|(/))p. So that, using 
Lemma [5.41 we get that for |m| < n'' 

||rA(|m,A)(m,A|)-|m)(m|||i = ||yA|m,A)(m,A|l^;-|m)(m|||i=0(n(9''-^)/2V(5i/6). (7.43) 

Putting the estimates (TMTI) . (mSj) . fTIS]) back into formula (mol) . wc obtain Ta(/5^'"'"), so that 
Ta(p°^"~")= ^ J-jMl_^J^J |m)(m|+0(n-i/2+7+'V,5,n(9''-2)/24/(5i/6). (7.44) 

Comparing with (|7.37p . and using (|7.39|) we get the desired result. 

D 
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7.7 Proof of Lemma 16.51 on local linearity of SU{d) 

The key is to notice that, as we are deahng with a group, there is a r such that 

U-\C + z, 0, n)[/(C, 0, n)U{z, 0, n) = [/(-(f - z, 0, n)U{C, 0, n)U{z, 0, n) = U{r, s, n), 

and similarly for the operation A. We shall prove below that under the condition that both ^ and 
z are smaller than n^ , then ||r|| + ||s|| = 0{n~^^^^^^ /S). Let us call this the domination hypothesis 
for further reference. 

Now, as the actions are unitary, we may rewrite the norm in Lemma as 



A^ 



[A. 



C+z,n 



Ai-"Af](|0,A)(0,A|) 



^-(C+.-),n[^C+.-«_^C,n^.-,n](|0,A)(0,A|) 



[Id-Ar^"](|0,A)(0,A|) 



As T\ is an isometry, we may also let it act the left and T^ on the right and get: 



A = 



T,(|o,A)(o,A|)-rAA^'^'"r;(|o)(o 



< Il|0)(0|-|r)(r1||,+ |r)(r1-T,A',^'"r;(|0)(0|) ^ + ||T,(|0, A) (0, A|) - |0> (0| ||i. 

By the domination hypothesis, the norm of r is smaller than •nT^I'^^'^^ j 6. hence (r|0) = 1 — 
0(„-i+4/3^^2)^ Using |||V')(V'I - I'/') (01 111 = 2^1 - |(7/'|0)|2 we get that the first term on the right 
side of the inequality is 0{rr^l'^^'^^ jb). Notice that this is dominated by i?(n) given in equation 
(|6TT|l since t] > 2/3. 

For the second term, we apply Lemma 16.41 with z = 0. By the domination hypothesis, ||s|| < 
j^-i/2+2/3^^^ so we may apply Lemma [6.41 and the remainder is given by R{n) in equation (|6.1ip . 

The last term is 0{n'^^'^~^'>/^^ /S^^^) as shown in (|7.43p which is dominated by R{n). 



We finish the proof of the lemma, and simultaneously that of Theorem l4.3[ by proving the domination 
hypothesis. Recall that an arbitrary element in SU{d) can be written in the exponential form 



U{r, s) := cxp 



^1 l<j<k<d 






where (r, s) e C^^'^^i^/^ x R'^-^, and T^^, Hi are the generators of SU{d) defined in dO]). A special 
case of this is U{'r) :— U{'r, 0). In general, the map (r, s) i-^ C/(r, s) is not injective but becomes so 
if we restrict to a small enough neighbourhood C of the origin (0,0) G £,'^(d--i)/2 y- jj^-i On this 
neighbourhood it makes sense to define the inverse as a sort of 'logarithm' 



\ogU{r,s) := {r,s), 



which is a C°° function. 
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By continuity of the product, ii x,y £ C'^(.'^^'^)/^ are small enough, then [/(— af— y)U{x)U{y) £ C. 
Since ||C|| + ||z||/\/n < n^~^^'^/S, we can apply this to x = C/v^i V — z/^/n for n > (C/S) i/a-f* with 
the constant C depending only on the dimension, and get 

(r7V^, s/V^) = fiC/V^, z/V^) := log [[/(-(C + z)/V^)UiC/V^)U{z/V^) 

Since / is a C°° function we can expand in Taylor series and it is easy to show that /(0,0) — 0, 
the first order partial derivatives are zero as well, and the second order derivatives are uniformly 
bounded in a neighbourhood of the origin. Thus we get 

r = V^O f 4^^, J^^) = 0(n-V2+2^/5). 
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